Predicting sepsis in-hospital mortality with machine learning: a multi-center study using clinical and inflammatory biomarkers

Guyu Zhang; Fei Shao; Wei Yuan; Junyuan Wu; Xuan Qi; Jie Gao; Rui Shao; Ziren Tang; Tao Wang

doi:10.1186/s40001-024-01756-0

Predicting sepsis in-hospital mortality with machine learning: a multi-center study using clinical and inflammatory biomarkers

Eur J Med Res. 2024 Mar 6;29(1):156. doi: 10.1186/s40001-024-01756-0.

Authors

Guyu Zhang¹, Fei Shao¹, Wei Yuan¹, Junyuan Wu¹, Xuan Qi¹, Jie Gao¹, Rui Shao¹, Ziren Tang^#², Tao Wang^#³

Affiliations

¹ Emergency Medicine Clinical Research Center, Beijing Chaoyang Hospital, Capital Medical University, Beijing Key Laboratory of Cardiopulmonary Cerebral Resuscitation, Beijing, 100020, China.
² Emergency Medicine Clinical Research Center, Beijing Chaoyang Hospital, Capital Medical University, Beijing Key Laboratory of Cardiopulmonary Cerebral Resuscitation, Beijing, 100020, China. TangZiren1970@126.com.
³ Emergency Medicine Clinical Research Center, Beijing Chaoyang Hospital, Capital Medical University, Beijing Key Laboratory of Cardiopulmonary Cerebral Resuscitation, Beijing, 100020, China. wangtao19780117@sina.com.

^# Contributed equally.

Abstract

Background: This study aimed to develop and validate an interpretable machine-learning model that utilizes clinical features and inflammatory biomarkers to predict the risk of in-hospital mortality in critically ill patients suffering from sepsis.

Methods: We enrolled all patients diagnosed with sepsis in the Medical Information Mart for Intensive Care IV (MIMIC-IV, v.2.0), eICU Collaborative Research Care (eICU-CRD 2.0), and the Amsterdam University Medical Centers databases (AmsterdamUMCdb 1.0.2). LASSO regression was employed for feature selection. Seven machine-learning methods were applied to develop prognostic models. The optimal model was chosen based on its accuracy, F1 score and area under curve (AUC) in the validation cohort. Moreover, we utilized the SHapley Additive exPlanations (SHAP) method to elucidate the effects of the features attributed to the model and analyze how individual features affect the model's output. Finally, Spearman correlation analysis examined the associations among continuous predictor variables. Restricted cubic splines (RCS) explored potential non-linear relationships between continuous risk factors and in-hospital mortality.

Results: 3535 patients with sepsis were eligible for participation in this study. The median age of the participants was 66 years (IQR, 55-77 years), and 56% were male. After selection, 12 of the 45 clinical parameters collected on the first day after ICU admission remained associated with prognosis and were used to develop machine-learning models. Among seven constructed models, the eXtreme Gradient Boosting (XGBoost) model achieved the best performance, with an AUC of 0.94 and an F1 score of 0.937 in the validation cohort. Feature importance analysis revealed that Age, AST, invasive ventilation treatment, and serum urea nitrogen (BUN) were the top four features of the XGBoost model with the most significant impact. Inflammatory biomarkers may have prognostic value. Furthermore, SHAP force analysis illustrated how the constructed model visualized the prediction of the model.

Conclusions: This study demonstrated the potential of machine-learning approaches for early prediction of outcomes in patients with sepsis. The SHAP method could improve the interoperability of machine-learning models and help clinicians better understand the reasoning behind the outcome.

Keywords: Intensive care unit; Machining learning; Prediction; Sepsis; XGBoost.

Publication types

Multicenter Study

MeSH terms

Aged
Area Under Curve
Biomarkers
Female
Hospital Mortality
Humans
Machine Learning
Male
Middle Aged
Sepsis*

Substances

Biomarkers