Developing machine learning models to predict multi-class functional outcomes and death three months after stroke in Sweden

Josline Adhiambo Otieno; Jenny Häggström; David Darehed; Marie Eriksson

doi:10.1371/journal.pone.0303287

Developing machine learning models to predict multi-class functional outcomes and death three months after stroke in Sweden

PLoS One. 2024 May 13;19(5):e0303287. doi: 10.1371/journal.pone.0303287. eCollection 2024.

Authors

Josline Adhiambo Otieno¹, Jenny Häggström¹, David Darehed², Marie Eriksson¹

Affiliations

¹ Department of Statistics, USBE, Umeå University, Umeå, Sweden.
² Department of Public Health and Clinical Medicine, Sunderby Research Unit, Umeå University, Umeå, Sweden.

Abstract

Globally, stroke is the third-leading cause of mortality and disability combined, and one of the costliest diseases in society. More accurate predictions of stroke outcomes can guide healthcare organizations in allocating appropriate resources to improve care and reduce both the economic and social burden of the disease. We aim to develop and evaluate the performance and explainability of three supervised machine learning models and the traditional multinomial logistic regression (mLR) in predicting functional dependence and death three months after stroke, using routinely-collected data. This prognostic study included adult patients, registered in the Swedish Stroke Registry (Riksstroke) from 2015 to 2020. Riksstroke contains information on stroke care and outcomes among patients treated in hospitals in Sweden. Prognostic factors (features) included demographic characteristics, pre-stroke functional status, cardiovascular risk factors, medications, acute care, stroke type, and severity. The outcome was measured using the modified Rankin Scale at three months after stroke (a scale of 0-2 indicates independent, 3-5 dependent, and 6 dead). Outcome prediction models included support vector machines, artificial neural networks (ANN), eXtreme Gradient Boosting (XGBoost), and mLR. The models were trained and evaluated on 75% and 25% of the dataset, respectively. Model predictions were explained using SHAP values. The study included 102,135 patients (85.8% ischemic stroke, 53.3% male, mean age 75.8 years, and median NIHSS of 3). All models demonstrated similar overall accuracy (69%-70%). The ANN and XGBoost models performed significantly better than the mLR in classifying dependence with F1-scores of 0.603 (95% CI; 0.594-0.611) and 0.577 (95% CI; 0.568-0.586), versus 0.544 (95% CI; 0.545-0.563) for the mLR model. The factors that contributed most to the predictions were expectedly similar in the models, based on clinical knowledge. Our ANN and XGBoost models showed a modest improvement in prediction performance and explainability compared to mLR using routinely-collected data. Their improved ability to predict functional dependence may be of particular importance for the planning and organization of acute stroke care and rehabilitation.

Copyright: © 2024 Otieno et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Aged
Aged, 80 and over
Female
Humans
Logistic Models
Machine Learning*
Male
Middle Aged
Neural Networks, Computer
Prognosis
Registries
Risk Factors
Stroke* / physiopathology
Support Vector Machine
Sweden / epidemiology

Grants and funding

The author(s) received no specific funding for this work.