Using a machine learning approach to predict mortality in critically ill influenza patients: a cross-sectional retrospective multicentre study in Taiwan

BMJ Open. 2020 Feb 25;10(2):e033898. doi: 10.1136/bmjopen-2019-033898.

Abstract

Objectives: Current mortality prediction models used in the intensive care unit (ICU) have a limited role for specific diseases such as influenza, and we aimed to establish an explainable machine learning (ML) model for predicting mortality in critically ill influenza patients using a real-world severe influenza data set.

Study design: A cross-sectional retrospective multicentre study in Taiwan SETTING: Eight medical centres in Taiwan.

Participants: A total of 336 patients requiring ICU-admission for virology-proven influenza at eight hospitals during an influenza epidemic between October 2015 and March 2016.

Primary and secondary outcome measures: We employed extreme gradient boosting (XGBoost) to establish the prediction model, compared the performance with logistic regression (LR) and random forest (RF), demonstrated the feature importance categorised by clinical domains, and used SHapley Additive exPlanations (SHAP) for visualised interpretation.

Results: The data set contained 76 features of the 336 patients with severe influenza. The severity was apparently high, as shown by the high Acute Physiology and Chronic Health Evaluation II score (22, 17 to 29) and pneumonia severity index score (118, 88 to 151). XGBoost model (area under the curve (AUC): 0.842; 95% CI 0.749 to 0.928) outperformed RF (AUC: 0.809; 95% CI 0.629 to 0.891) and LR (AUC: 0.701; 95% CI 0.573 to 0.825) for predicting 30-day mortality. To give clinicians an intuitive understanding of feature exploitation, we stratified features by the clinical domain. The cumulative feature importance in the fluid balance domain, ventilation domain, laboratory data domain, demographic and symptom domain, management domain and severity score domain was 0.253, 0.113, 0.177, 0.140, 0.152 and 0.165, respectively. We further used SHAP plots to illustrate associations between features and 30-day mortality in critically ill influenza patients.

Conclusions: We used a real-world data set and applied an ML approach, mainly XGBoost, to establish a practical and explainable mortality prediction model in critically ill influenza patients.

Keywords: adult intensive & critical care; infectious diseases & infestations; information technology; thoracic medicine.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Critical Illness / mortality*
  • Cross-Sectional Studies
  • Female
  • Hospital Mortality
  • Humans
  • Influenza, Human / mortality*
  • Machine Learning*
  • Male
  • Middle Aged
  • Models, Theoretical
  • Outcome Assessment, Health Care
  • Retrospective Studies
  • Taiwan