Application of Machine Learning Techniques in the HELIAD Study Data for the Development of Diagnostic Models in MCI and Dementia

Adv Exp Med Biol. 2023:1424:187-192. doi: 10.1007/978-3-031-31982-2_20.

Abstract

The increase in the population's life expectancy leads to an increase in the incidence of dementia and, therefore, in diseases such as Alzheimer's. Towards this direction, the HELIAD1 study is the first large-scale epidemiological study aimed at assessing epidemiological data on dementia, mild mental decline, and other neuropsychiatric disorders associated with old age. This is a huge study with several computational challenges, most of which can be addressed by machine learning processes. The objectives of this study were to detect patterns in the HELIAD clinical data that classify with high accuracy various levels of cognitive impairment by training ML algorithms and hence apply derived model on future clinical data to predict with the same accuracy the class variable. We propose a machine learning method based on RUSBoost classifier to identify a critical subset of biomarkers that classify accurately between neurological patients with mild cognitive impairment (MCI) or dementia of the Alzheimer's type (DAT) and the cognitively healthy control (CHC) group. In this study we used a highly skewed (imbalanced) dataset with most observations (majority class) belonging to the CHC group. The method proposed predicts accurately the clinical diagnosis label and effectively classifies the neurological patients from the CHC class. In particular, the classification accuracy (actual vs predicted) for the three classes of the clinical diagnosis was 97%, 78%, and 91% for control, MCI, and dementia class, respectively.

Keywords: Alzheimer’s disease; Cognitive impairment; Dementia; HELIAD study; Lifestyle; Machine learning.

MeSH terms

  • Alzheimer Disease* / complications
  • Alzheimer Disease* / diagnosis
  • Alzheimer Disease* / epidemiology
  • Biomarkers
  • Cognitive Dysfunction* / complications
  • Cognitive Dysfunction* / diagnosis
  • Cognitive Dysfunction* / epidemiology
  • Disease Progression
  • Humans
  • Machine Learning
  • Sensitivity and Specificity

Substances

  • Biomarkers