Benchmarking emergency department prediction models with machine learning and public electronic health records

Feng Xie; Jun Zhou; Jin Wee Lee; Mingrui Tan; Siqi Li; Logasan S/O Rajnthern; Marcel Lucas Chee; Bibhas Chakraborty; An-Kwok Ian Wong; Alon Dagan; Marcus Eng Hock Ong; Fei Gao; Nan Liu

doi:10.1038/s41597-022-01782-9

Benchmarking emergency department prediction models with machine learning and public electronic health records

Sci Data. 2022 Oct 27;9(1):658. doi: 10.1038/s41597-022-01782-9.

Authors

Feng Xie^#¹, Jun Zhou^#², Jin Wee Lee¹, Mingrui Tan², Siqi Li¹, Logasan S/O Rajnthern³, Marcel Lucas Chee⁴, Bibhas Chakraborty^{1

5

6}, An-Kwok Ian Wong⁷, Alon Dagan^{8

9}, Marcus Eng Hock Ong^{1

10}, Fei Gao², Nan Liu^{11

12

13}

Affiliations

¹ Centre for Quantitative Medicine and Programme in Health Services and Systems Research, Duke-NUS Medical School, Singapore, Singapore.
² Institute of High Performance Computing, Agency for Science, Technology and Research (A*STAR), Singapore, Singapore.
³ School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore, Singapore.
⁴ Faculty of Medicine, Nursing and Health Sciences, Monash University, Victoria, Australia.
⁵ Department of Statistics and Data Science, National University of Singapore, Singapore, Singapore.
⁶ Department of Biostatistics and Bioinformatics, Duke University, Durham, NC, USA.
⁷ Division of Pulmonary, Allergy, and Critical Care Medicine, Duke University, Durham, NC, USA.
⁸ Department of Emergency Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA.
⁹ MIT Critical Data, Laboratory for Computational Physiology, Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA.
¹⁰ Department of Emergency Medicine, Singapore General Hospital, Singapore, Singapore.
¹¹ Centre for Quantitative Medicine and Programme in Health Services and Systems Research, Duke-NUS Medical School, Singapore, Singapore. liu.nan@duke-nus.edu.sg.
¹² SingHealth AI Health Program, Singapore Health Services, Singapore, Singapore. liu.nan@duke-nus.edu.sg.
¹³ Institute of Data Science, National University of Singapore, Singapore, Singapore. liu.nan@duke-nus.edu.sg.

^# Contributed equally.

Abstract

The demand for emergency department (ED) services is increasing across the globe, particularly during the current COVID-19 pandemic. Clinical triage and risk assessment have become increasingly challenging due to the shortage of medical resources and the strain on hospital infrastructure caused by the pandemic. As a result of the widespread use of electronic health records (EHRs), we now have access to a vast amount of clinical data, which allows us to develop prediction models and decision support systems to address these challenges. To date, there is no widely accepted clinical prediction benchmark related to the ED based on large-scale public EHRs. An open-source benchmark data platform would streamline research workflows by eliminating cumbersome data preprocessing, and facilitate comparisons among different studies and methodologies. Based on the Medical Information Mart for Intensive Care IV Emergency Department (MIMIC-IV-ED) database, we created a benchmark dataset and proposed three clinical prediction benchmarks. This study provides future researchers with insights, suggestions, and protocols for managing data and developing predictive tools for emergency care.

MeSH terms

Benchmarking*
COVID-19*
Electronic Health Records
Emergency Service, Hospital
Humans
Machine Learning
Pandemics