Survival Analysis in Cognitively Normal Subjects and in Patients with Mild Cognitive Impairment Using a Proportional Hazards Model with Extreme Gradient Boosting Regression

J Alzheimers Dis. 2022;85(2):837-850. doi: 10.3233/JAD-215266.

Abstract

Background: Evaluating the risk of Alzheimer's disease (AD) in cognitively normal (CN) and patients with mild cognitive impairment (MCI) is extremely important. While MCI-to-AD progression risk has been studied extensively, few studies estimate CN-to-MCI conversion risk. The Cox proportional hazards (PH), a widely used survival analysis model, assumes a linear predictor-risk relationship. Generalizing the PH model to more complex predictor-risk relationships may increase risk estimation accuracy.

Objective: The aim of this study was to develop a PH model using an Xgboost regressor, based on demographic, genetic, neuropsychiatric, and neuroimaging predictors to estimate risk of AD in patients with MCI, and the risk of MCI in CN subjects.

Methods: We replaced the Cox PH linear model with an Xgboost regressor to capture complex interactions between predictors, and non-linear predictor-risk associations. We endeavored to limit model inputs to noninvasive and more widely available predictors in order to facilitate future applicability in a wider setting.

Results: In MCI-to-AD (n = 882), the Xgboost model achieved a concordance index (C-index) of 84.5%. When the model was used for MCI risk prediction in CN (n = 100) individuals, the C-index was 73.3%. In both applications, the C-index was statistically significantly higher in the Xgboost in comparison to the Cox PH model.

Conclusion: Using non-linear regressors such as Xgboost improves AD dementia risk assessment in CN and MCI. It is possible to achieve reasonable risk stratification using predictors that are relatively low-cost in terms of time, invasiveness, and availability. Future strategies for improving AD dementia risk estimation are discussed.

Keywords: Alzheimer’s disease; Xgboost; brain; hippocampal atrophy; machine learning; magnetic resonance imaging; mild cognitive impairment; proportional hazards model; survival analysis.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Aged
  • Aged, 80 and over
  • Alzheimer Disease / diagnosis*
  • Alzheimer Disease / epidemiology
  • Alzheimer Disease / genetics
  • Cognitive Dysfunction / diagnosis*
  • Cognitive Dysfunction / epidemiology
  • Cognitive Dysfunction / genetics
  • Disease Progression
  • Female
  • Genetic Testing / methods
  • Humans
  • Magnetic Resonance Imaging
  • Male
  • Neuropsychological Tests
  • Prognosis
  • Proportional Hazards Models*
  • Risk Assessment / methods
  • Survival Analysis