A 3-Gene-Based Diagnostic Signature in Alzheimer's Disease

Eur Neurol. 2022;85(1):6-13. doi: 10.1159/000518727. Epub 2021 Sep 14.

Abstract

Background: Alzheimer's disease (AD) is a chronic neurodegenerative disease. In this study, potential diagnostic biomarkers were identified for AD.

Methods: All AD samples and healthy samples were collected from 2 datasets in the GEO database, in which differentially expressed genes (DEGs) were analyzed by using the limma package of R language. GO and KEGG pathway enrichment was conducted basing on the DEGs via the clusterProfiler package of R. And, the PPI network construction and gene prediction were performed using the STRING database and Cytoscape. Then, a logistic regression model was constructed to predict the sample type.

Results: Bioinformatic analysis of GEO datasets revealed 2,063 and 108 DEGs in GSE5281 and GSE4226 datasets, separately, and 15 overlapping DEGs were found. GO and KEGG enrichment analysis revealed terms associated with neurodevelopment. Then, we built a logistic regression model based on the hub genes from the PPI network and optimized the model to 3 genes (ALDOA, ENC1, and NFKBIA). The values of area under the curve of the training set GSE5281 and testing set GSE4226 were 0.9647 and 0.7857, respectively, which implied the efficacy of this model.

Conclusion: The comprehensive bioinformatic analysis of gene expression in AD patients and the effective logistic regression model built in our study may provide promising research value for diagnostic methods of AD.

Keywords: Alzheimer’s disease; Bioinformatic analysis; Diagnosis; Logistic regression model.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alzheimer Disease* / diagnosis
  • Alzheimer Disease* / genetics
  • Gene Expression Profiling
  • Gene Regulatory Networks / genetics
  • Humans
  • Neurodegenerative Diseases*
  • Protein Interaction Maps / genetics