Analysis of Risk Factors in Dementia Through Machine Learning

J Alzheimers Dis. 2021;79(2):845-861. doi: 10.3233/JAD-200955.

Abstract

Background: Sociodemographic data indicate the progressive increase in life expectancy and the prevalence of Alzheimer's disease (AD). AD is raised as one of the greatest public health problems. Its etiology is twofold: on the one hand, non-modifiable factors and on the other, modifiable.

Objective: This study aims to develop a processing framework based on machine learning (ML) and optimization algorithms to study sociodemographic, clinical, and analytical variables, selecting the best combination among them for an accurate discrimination between controls and subjects with major neurocognitive disorder (MNCD).

Methods: This research is based on an observational-analytical design. Two research groups were established: MNCD group (n = 46) and control group (n = 38). ML and optimization algorithms were employed to automatically diagnose MNCD.

Results: Twelve out of 37 variables were identified in the validation set as the most relevant for MNCD diagnosis. Sensitivity of 100%and specificity of 71%were achieved using a Random Forest classifier.

Conclusion: ML is a potential tool for automatic prediction of MNCD which can be applied to relatively small preclinical and clinical data sets. These results can be interpreted to support the influence of the environment on the development of AD.

Keywords: Alzheimer’s disease; machine learning; neurocognitive disorders; risk factors.

Publication types

  • Observational Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Aged
  • Aged, 80 and over
  • Algorithms
  • Alzheimer Disease / diagnosis
  • Alzheimer Disease / etiology*
  • Case-Control Studies
  • Cognitive Reserve
  • Depression / complications
  • Diabetes Mellitus, Type 2 / complications
  • Exercise
  • Female
  • Humans
  • Hypertension / complications
  • Machine Learning*
  • Male
  • Neurocognitive Disorders / diagnosis
  • Neurocognitive Disorders / etiology
  • Risk Factors
  • Sensitivity and Specificity
  • Socioeconomic Factors
  • Tobacco Use / adverse effects