Sick patients have more data: the non-random completeness of electronic health records

AMIA Annu Symp Proc. 2013 Nov 16:2013:1472-7. eCollection 2013.

Abstract

As interest in the reuse of electronic health record (EHR) data for research purposes grows, so too does awareness of the significant data quality problems in these non-traditional datasets. In the past, however, little attention has been paid to whether poor data quality merely introduces noise into EHR-derived datasets, or if there is potential for the creation of spurious signals and bias. In this study we use EHR data to demonstrate a statistically significant relationship between EHR completeness and patient health status, indicating that records with more data are likely to be more representative of sick patients than healthy ones, and therefore may not reflect the broader population found within the EHR.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Adult
  • Aged
  • Analysis of Variance
  • Anesthesiology
  • Electronic Health Records* / standards
  • Electronic Health Records* / statistics & numerical data
  • Female
  • Health Status*
  • Humans
  • Male
  • Middle Aged
  • Patients
  • Research Design
  • Societies, Medical
  • United States