Explainable artificial intelligence model for mortality risk prediction in the intensive care unit: a derivation and validation study

Chang Hu; Chao Gao; Tianlong Li; Chang Liu; Zhiyong Peng

doi:10.1093/postmj/qgad144

Explainable artificial intelligence model for mortality risk prediction in the intensive care unit: a derivation and validation study

Postgrad Med J. 2024 Mar 18;100(1182):219-227. doi: 10.1093/postmj/qgad144.

Authors

Chang Hu^{1

2}, Chao Gao^{1

2}, Tianlong Li^{1

2}, Chang Liu^{1

2}, Zhiyong Peng^{1

2}

Affiliations

¹ Department of Critical Care Medicine, Zhongnan Hospital of Wuhan University, Wuhan 430071, Hubei, China.
² Clinical Research Center of Hubei Critical Care Medicine, Wuhan 430071, Hubei, China.

PMID: 38244550
DOI: 10.1093/postmj/qgad144

Abstract

Background: The lack of transparency is a prevalent issue among the current machine-learning (ML) algorithms utilized for predicting mortality risk. Herein, we aimed to improve transparency by utilizing the latest ML explicable technology, SHapley Additive exPlanation (SHAP), to develop a predictive model for critically ill patients.

Methods: We extracted data from the Medical Information Mart for Intensive Care IV database, encompassing all intensive care unit admissions. We employed nine different methods to develop the models. The most accurate model, with the highest area under the receiver operating characteristic curve, was selected as the optimal model. Additionally, we used SHAP to explain the workings of the ML model.

Results: The study included 21 395 critically ill patients, with a median age of 68 years (interquartile range, 56-79 years), and most patients were male (56.9%). The cohort was randomly split into a training set (N = 16 046) and a validation set (N = 5349). Among the nine models developed, the Random Forest model had the highest accuracy (87.62%) and the best area under the receiver operating characteristic curve value (0.89). The SHAP summary analysis showed that Glasgow Coma Scale, urine output, and blood urea nitrogen were the top three risk factors for outcome prediction. Furthermore, SHAP dependency analysis and SHAP force analysis were used to interpret the Random Forest model at the factor level and individual level, respectively.

Conclusion: A transparent ML model for predicting outcomes in critically ill patients using SHAP methodology is feasible and effective. SHAP values significantly improve the explainability of ML models.

Keywords: critical illness; explainable artificial intelligence; machine learning; mortality.

MeSH terms

Aged
Algorithms
Artificial Intelligence*
Critical Care
Critical Illness* / therapy
Female
Humans
Intensive Care Units
Male
Middle Aged

Abstract

MeSH terms

Grants and funding