Machine learning for the prediction of all-cause mortality in patients with sepsis-associated acute kidney injury during hospitalization

Hongshan Zhou; Leping Liu; Qinyu Zhao; Xin Jin; Zhangzhe Peng; Wei Wang; Ling Huang; Yanyun Xie; Hui Xu; Lijian Tao; Xiangcheng Xiao; Wannian Nie; Fang Liu; Li Li; Qiongjing Yuan

doi:10.3389/fimmu.2023.1140755

Machine learning for the prediction of all-cause mortality in patients with sepsis-associated acute kidney injury during hospitalization

Front Immunol. 2023 Apr 3:14:1140755. doi: 10.3389/fimmu.2023.1140755. eCollection 2023.

Authors

Hongshan Zhou¹, Leping Liu², Qinyu Zhao³, Xin Jin⁴, Zhangzhe Peng^{1

5

6}, Wei Wang^{1

5

6}, Ling Huang^{1

5

6}, Yanyun Xie^{1

5

6}, Hui Xu¹, Lijian Tao^{1

5

6}, Xiangcheng Xiao¹, Wannian Nie¹, Fang Liu⁷, Li Li⁸, Qiongjing Yuan^{1

5

6

9}

Affiliations

¹ Department of Nephrology, Xiangya Hospital of Central South University, Changsha, Hunan, China.
² Department of Pediatrics, The Third Xiangya Hospital, Central South University, Changsha, China.
³ College of Engineering and Computer Science, Australian National University, Canberra, ACT, Australia.
⁴ Critical Care Medicine, The Third Xiangya Hospital, Central South University, Changsha, Hunan, China.
⁵ Organ Fibrosis Key Lab of Hunan Province, Central South University, Changsha, Hunan, China.
⁶ National International Joint Research Center for Medical Metabolomices, Xiangya Hospital, Central South University, Changsha, Hunan, China.
⁷ Health Management Center, Xiangya Hospital of Central South University, Changsha, Hunan, China.
⁸ Critical Care Medicine, Xiangya Hospital of Central South University, Changsha, Hunan, China.
⁹ National Clinical Medical Research Center for Geriatric Diseases, Xiangya Hospital of Central South University, Changsha, Hunan, China.

Abstract

Background: Sepsis-associated acute kidney injury (S-AKI) is considered to be associated with high morbidity and mortality, a commonly accepted model to predict mortality is urged consequently. This study used a machine learning model to identify vital variables associated with mortality in S-AKI patients in the hospital and predict the risk of death in the hospital. We hope that this model can help identify high-risk patients early and reasonably allocate medical resources in the intensive care unit (ICU).

Methods: A total of 16,154 S-AKI patients from the Medical Information Mart for Intensive Care IV database were examined as the training set (80%) and the validation set (20%). Variables (129 in total) were collected, including basic patient information, diagnosis, clinical data, and medication records. We developed and validated machine learning models using 11 different algorithms and selected the one that performed the best. Afterward, recursive feature elimination was used to select key variables. Different indicators were used to compare the prediction performance of each model. The SHapley Additive exPlanations package was applied to interpret the best machine learning model in a web tool for clinicians to use. Finally, we collected clinical data of S-AKI patients from two hospitals for external validation.

Results: In this study, 15 critical variables were finally selected, namely, urine output, maximum blood urea nitrogen, rate of injection of norepinephrine, maximum anion gap, maximum creatinine, maximum red blood cell volume distribution width, minimum international normalized ratio, maximum heart rate, maximum temperature, maximum respiratory rate, minimum fraction of inspired O₂, minimum creatinine, minimum Glasgow Coma Scale, and diagnosis of diabetes and stroke. The categorical boosting algorithm model presented significantly better predictive performance [receiver operating characteristic (ROC): 0.83] than other models [accuracy (ACC): 75%, Youden index: 50%, sensitivity: 75%, specificity: 75%, F1 score: 0.56, positive predictive value (PPV): 44%, and negative predictive value (NPV): 92%]. External validation data from two hospitals in China were also well validated (ROC: 0.75).

Conclusions: After selecting 15 crucial variables, a machine learning-based model for predicting the mortality of S-AKI patients was successfully established and the CatBoost model demonstrated best predictive performance.

Keywords: acute kidney injury; machine learning; mortality; predictive model; sepsis.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Acute Kidney Injury* / diagnosis
Acute Kidney Injury* / etiology
Creatinine
Hospitalization
Humans
Machine Learning
Sepsis* / complications

Substances

Creatinine

Grants and funding

This work was supported by the Natural Science Foundation of Hunan province China (Grant Nos. 2020JJ5942, 2019JJ40515, and 2019JJ20035), the Major Program of the National Natural Science Foundation of China (Grant No. 82090024), the General Programs of the National Natural Science Foundation of China (Grant No. 82173877), and the Key Research and Development Program of Hunan Province (Grant No. 2021SK2015).