Working Towards a Blood-Derived Gene Expression Biomarker Specific for Alzheimer's Disease

J Alzheimers Dis. 2020;74(2):545-561. doi: 10.3233/JAD-191163.

Abstract

Background: The typical approach to identify blood-derived gene expression signatures as a biomarker for Alzheimer's disease (AD) have relied on training classification models using AD and healthy controls only. This may inadvertently result in the identification of markers for general illness rather than being disease-specific.

Objective: Investigate whether incorporating additional related disorders in the classification model development process can lead to the discovery of an AD-specific gene expression signature.

Methods: Two types of XGBoost classification models were developed. The first used 160 AD and 127 healthy controls and the second used the same 160 AD with 6,318 upsampled mixed controls consisting of Parkinson's disease, multiple sclerosis, amyotrophic lateral sclerosis, bipolar disorder, schizophrenia, coronary artery disease, rheumatoid arthritis, chronic obstructive pulmonary disease, and cognitively healthy subjects. Both classification models were evaluated in an independent cohort consisting of 127 AD and 687 mixed controls.

Results: The AD versus healthy control models resulted in an average 48.7% sensitivity (95% CI = 34.7-64.6), 41.9% specificity (95% CI = 26.8-54.3), 13.6% PPV (95% CI = 9.9-18.5), and 81.1% NPV (95% CI = 73.3-87.7). In contrast, the mixed control models resulted in an average of 40.8% sensitivity (95% CI = 27.5-52.0), 95.3% specificity (95% CI = 93.3-97.1), 61.4% PPV (95% CI = 53.8-69.6), and 89.7% NPV (95% CI = 87.8-91.4).

Conclusions: This early work demonstrates the value of incorporating additional related disorders into the classification model developmental process, which can result in models with improved ability to distinguish AD from a heterogeneous aging population. However, further improvement to the sensitivity of the test is still required.

Keywords: Age-related memory disorders; Alzheimer’s disease; biomarkers; dementia; gene expression; human; machine learning; microarray analysis; neurodegenerative disorders.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alzheimer Disease / blood*
  • Alzheimer Disease / diagnosis
  • Alzheimer Disease / genetics*
  • Biomarkers / blood
  • Databases, Genetic / trends*
  • Gene Expression
  • Humans
  • Protein Array Analysis / methods
  • Protein Array Analysis / trends*

Substances

  • Biomarkers