Training machine learning models to predict 30-day mortality in patients discharged from the emergency department: a retrospective, population-based registry study

Mathias Carl Blom; Awais Ashfaq; Anita Sant'Anna; Philip D Anderson; Markus Lingman

doi:10.1136/bmjopen-2018-028015

Training machine learning models to predict 30-day mortality in patients discharged from the emergency department: a retrospective, population-based registry study

BMJ Open. 2019 Aug 10;9(8):e028015. doi: 10.1136/bmjopen-2018-028015.

Authors

Mathias Carl Blom¹, Awais Ashfaq^{2

3}, Anita Sant'Anna², Philip D Anderson^{4

5}, Markus Lingman^{6

7}

Affiliations

¹ Department of Clinical Sciences Lund, Medicine, Lund University, Medical Faculty, Lund, Sweden.
² Center for Applied Intelligent Systems Research (CAISR), Halmstad University, Halmstad, Sweden.
³ Halland Hospital, Region Halland, Halmstad, Sweden.
⁴ Department of Emergency Medicine, Brigham and Women's Hospital, Boston, Massachusetts, USA.
⁵ Harvard Medical School, Boston, Massachusetts, USA.
⁶ Halland Hospital, Region Halland, Halmstad, Sweden Markus.Lingman@regionhalland.se.
⁷ Department of Molecular and Clinical Medicine/Cardiology, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden.

Abstract

Objectives: The aim of this work was to train machine learning models to identify patients at end of life with clinically meaningful diagnostic accuracy, using 30-day mortality in patients discharged from the emergency department (ED) as a proxy.

Design: Retrospective, population-based registry study.

Setting: Swedish health services.

Primary and secondary outcome measures: All cause 30-day mortality.

Methods: Electronic health records (EHRs) and administrative data were used to train six supervised machine learning models to predict all-cause mortality within 30 days in patients discharged from EDs in southern Sweden, Europe.

Participants: The models were trained using 65 776 ED visits and validated on 55 164 visits from a separate ED to which the models were not exposed during training.

Results: The outcome occurred in 136 visits (0.21%) in the development set and in 83 visits (0.15%) in the validation set. The model with highest discrimination attained ROC-AUC 0.95 (95% CI 0.93 to 0.96), with sensitivity 0.87 (95% CI 0.80 to 0.93) and specificity 0.86 (0.86 to 0.86) on the validation set.

Conclusions: Multiple models displayed excellent discrimination on the validation set and outperformed available indexes for short-term mortality prediction interms of ROC-AUC (by indirect comparison). The practical utility of the models increases as the data they were trained on did not require costly de novo collection but were real-world data generated as a by-product of routine care delivery.

Keywords: advance care planning; emergency medicine; machine learning; mortality.

Publication types

Research Support, Non-U.S. Gov't
Validation Study

MeSH terms

Adult
Aged
Aged, 80 and over
Electronic Health Records*
Emergency Service, Hospital / statistics & numerical data*
Female
Humans
Logistic Models
Machine Learning*
Male
Middle Aged
Mortality*
Patient Discharge / statistics & numerical data*
Registries
Retrospective Studies
Sweden / epidemiology
Time Factors
Young Adult