The Dependence of Machine Learning on Electronic Medical Record Quality

Long V Ho; David Ledbetter; Melissa Aczon; Randall Wetzel

The Dependence of Machine Learning on Electronic Medical Record Quality

AMIA Annu Symp Proc. 2018 Apr 16:2017:883-891. eCollection 2017.

Authors

Long V Ho¹, David Ledbetter¹, Melissa Aczon¹, Randall Wetzel¹

Affiliation

¹ The Laura P. and Leland K. Whittier Virtual Pediatric Intensive Care Unit Children's Hospital Los Angeles, Los Angeles, CA.

PMID: 29854155
PMCID: PMC5977633

Abstract

There is growing interest in applying machine learning methods to Electronic Medical Records (EMR). Across different institutions, however, EMR quality can vary widely. This work investigated the impact of this disparity on the performance of three advanced machine learning algorithms: logistic regression, multilayer perceptron, and recurrent neural network. The EMR disparity was emulated using different permutations of the EMR collected at Children's Hospital Los Angeles (CHLA) Pediatric Intensive Care Unit (PICU) and Cardiothoracic Intensive Care Unit (CTICU). The algorithms were trained using patients from the PICU to predict in-ICU mortality for patients on a held out set of PICU and CTICU patients. The disparate patient populations between the PICU and CTICU provide an estimate of generalization errors across different ICUs. We quantified and evaluated the generalization of these algorithms on varying EMR size, input types, and fidelity of data.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Adolescent
Algorithms
Child
Child, Preschool
Coronary Care Units
Data Accuracy*
Electronic Health Records* / standards
Hospital Mortality
Hospitals, Pediatric
Humans
Infant
Intensive Care Units, Pediatric*
Logistic Models
Los Angeles
Machine Learning*