Bioinformatics and machine learning methodologies to identify the effects of central nervous system disorders on glioblastoma progression

Brief Bioinform. 2021 Sep 2;22(5):bbaa365. doi: 10.1093/bib/bbaa365.

Abstract

Glioblastoma (GBM) is a common malignant brain tumor which often presents as a comorbidity with central nervous system (CNS) disorders. Both CNS disorders and GBM cells release glutamate and show an abnormality, but differ in cellular behavior. So, their etiology is not well understood, nor is it clear how CNS disorders influence GBM behavior or growth. This led us to employ a quantitative analytical framework to unravel shared differentially expressed genes (DEGs) and cell signaling pathways that could link CNS disorders and GBM using datasets acquired from the Gene Expression Omnibus database (GEO) and The Cancer Genome Atlas (TCGA) datasets where normal tissue and disease-affected tissue were examined. After identifying DEGs, we identified disease-gene association networks and signaling pathways and performed gene ontology (GO) analyses as well as hub protein identifications to predict the roles of these DEGs. We expanded our study to determine the significant genes that may play a role in GBM progression and the survival of the GBM patients by exploiting clinical and genetic factors using the Cox Proportional Hazard Model and the Kaplan-Meier estimator. In this study, 177 DEGs with 129 upregulated and 48 downregulated genes were identified. Our findings indicate new ways that CNS disorders may influence the incidence of GBM progression, growth or establishment and may also function as biomarkers for GBM prognosis and potential targets for therapies. Our comparison with gold standard databases also provides further proof to support the connection of our identified biomarkers in the pathology underlying the GBM progression.

Keywords: bioinformatics; central nervous system disorders; comorbidity; glioblastoma; machine learning; ontology; pathway; proteins; survival analysis.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Atlases as Topic
  • Brain Neoplasms / genetics*
  • Brain Neoplasms / metabolism
  • Brain Neoplasms / mortality
  • Brain Neoplasms / pathology
  • Central Nervous System / metabolism*
  • Central Nervous System / pathology
  • Computational Biology / methods
  • Datasets as Topic
  • Gene Expression Profiling
  • Gene Expression Regulation, Neoplastic
  • Gene Ontology
  • Gene Regulatory Networks*
  • Glioblastoma / genetics*
  • Glioblastoma / metabolism
  • Glioblastoma / mortality
  • Glioblastoma / pathology
  • Glutamic Acid / metabolism
  • Humans
  • Kaplan-Meier Estimate
  • Machine Learning*
  • Molecular Sequence Annotation
  • Neoplasm Proteins / classification
  • Neoplasm Proteins / genetics*
  • Neoplasm Proteins / metabolism
  • Proportional Hazards Models
  • Signal Transduction

Substances

  • Neoplasm Proteins
  • Glutamic Acid