A clinical prediction model based on interpretable machine learning algorithms for prolonged hospital stay in acute ischemic stroke patients: a real-world study

Front Endocrinol (Lausanne). 2023 Nov 22:14:1165178. doi: 10.3389/fendo.2023.1165178. eCollection 2023.

Abstract

Objective: Acute ischemic stroke (AIS) brings an increasingly heavier economic burden nowadays. Prolonged length of stay (LOS) is a vital factor in healthcare expenditures. The aim of this study was to predict prolonged LOS in AIS patients based on an interpretable machine learning algorithm.

Methods: We enrolled AIS patients in our hospital from August 2017 to July 2019, and divided them into the "prolonged LOS" group and the "no prolonged LOS" group. Prolonged LOS was defined as hospitalization for more than 7 days. The least absolute shrinkage and selection operator (LASSO) regression was applied to reduce the dimensionality of the data. We compared the predictive capacity of extended LOS in eight different machine learning algorithms. SHapley Additive exPlanations (SHAP) values were used to interpret the outcome, and the most optimal model was assessed by discrimination, calibration, and clinical utility.

Results: Prolonged LOS developed in 149 (22.0%) of the 677 eligible patients. In eight machine learning algorithms, prolonged LOS was best predicted by the Gaussian naive Bayes (GNB) model, which had a striking area under the curve (AUC) of 0.878 ± 0.007 in the training set and 0.857 ± 0.039 in the validation set. The variables sorted by the gap values showed that the strongest predictors were pneumonia, dysphagia, thrombectomy, and stroke severity. High net benefits were observed at 0%-76% threshold probabilities, while good agreement was found between the observed and predicted probabilities.

Conclusions: The model using the GNB algorithm proved excellent for predicting prolonged LOS in AIS patients. This simple model of prolonged hospitalization could help adjust policies and better utilize resources.

Keywords: SHAP (SHapley Additive exPlanations); machine learning; prediction model; prolonged hospital stay; stroke.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Bayes Theorem
  • Humans
  • Ischemic Stroke* / therapy
  • Length of Stay
  • Machine Learning
  • Models, Statistical
  • Prognosis

Grants and funding

This study was supported by: Scientific Research Project of Jiangsu Health Committee (No.H2019054), the Xuzhou Science and Technology Planning Project (No. KC21220) and Science and Technology Development Fund of Affiliated Hospital of Xuzhou Medical University (No.XYFY202250), Shaanxi Provincial Health and Health Research Fund Project (2022E006).