Comparing traditional modeling approaches versus predictive analytics methods for predicting multiple sclerosis relapse

Mult Scler Relat Disord. 2022 Jan:57:103330. doi: 10.1016/j.msard.2021.103330. Epub 2021 Oct 17.

Abstract

Objective: This study compared traditional statistical methods to different predictive analytics methods on the endpoint of multiple sclerosis (MS) relapse.

Study setting: This is a secondary data analysis on four different MS Centers based on the third year of data, July 2019-June 2020.

Study design: The parent study is a two-part, 3-year clinical quality improvement prospective study that started in June 2017 and concluded in June 2020, and utilizes a prospective stepped-wedge randomized design. Binary logistic regression was compared with other machine learning models, specifically ridge, least absolute shrinkage and selection operator (LASSO), and random forest.

Data collection: This study used electronic health record data extracted at the individual level and 'rolled up' to the system and population level. Inclusion criteria included participants aged 18 years or older, with MS presenting to any of the four centers, who entered the study in any quarter. Exclusion criteria included cases with missing or incorrectly input data and those who refused to participate in the study.

Principal findings: When comparing relapse indices across models, random forest significantly outperformed logistic regression and other machine learning algorithms (ΔperfA =27.1%, ΔperfM =27.5%). However, for ΔperfF, logistic regression and random forest performed relatively the same. Ridge and LASSO outperformed logistic regression (ΔperfM1 =0.9%, ΔperfM2 =9.4%, ΔperfF2=25.8%, respectively).

Conclusion: Multiple sclerosis is a complex and costly chronic ("3C") condition that currently has no cure. In a condition like MS, which has an unpredictable course, the use of predictive analytics could help health systems learn better, faster, and to improve more effectively and predict rather than react to emerging health needs for people with MS. Comparing the predictability of relapse across various models with a predictive analytics framework can potentially change how we manage MS care.

Keywords: Model comparison; Multiple sclerosis; Predictive analytics; Quality improvement; Relapse; Statistical modeling.

Publication types

  • Randomized Controlled Trial

MeSH terms

  • Adolescent
  • Humans
  • Logistic Models
  • Machine Learning
  • Multiple Sclerosis* / diagnosis
  • Multiple Sclerosis* / epidemiology
  • Multiple Sclerosis* / therapy
  • Prospective Studies
  • Recurrence