Addressing Bias in Electronic Health Record-Based Surveillance of Cardiovascular Disease Risk: Finding the Signal Through the Noise

Julie K Bower; Sejal Patel; Joyce E Rudy; Ashley S Felix

doi:10.1007/s40471-017-0130-z

Addressing Bias in Electronic Health Record-Based Surveillance of Cardiovascular Disease Risk: Finding the Signal Through the Noise

Curr Epidemiol Rep. 2017 Dec;4(4):346-352. doi: 10.1007/s40471-017-0130-z. Epub 2017 Nov 2.

Authors

Julie K Bower^{1

2}, Sejal Patel¹, Joyce E Rudy¹, Ashley S Felix¹

Affiliations

¹ Division of Epidemiology, College of Public Health, The Ohio State University, Columbus, OH.
² Division of Cardiovascular Medicine, The Ohio State University College of Medicine, Columbus, OH.

Abstract

Purpose of review: Use of the electronic health record (EHR) for CVD surveillance is increasingly common. However, these data can introduce systematic error that influences the internal and external validity of study findings. We reviewed recent literature on EHR-based studies of CVD risk to summarize the most common types of bias that arise. Subsequently, we recommend strategies informed by work from others as well as our own to reduce the impact of these biases in future research.

Recent findings: Systematic error, or bias, is a concern in all observational research including EHR-based studies of CVD risk surveillance. Patients captured in an EHR system may not be representative of the general population, due to issues such as informed presence bias, perceptions about the healthcare system that influence entry, and access to health services. Further, the EHR may contain inaccurate information or be missing key data points of interest due to loss to follow-up or over-diagnosis bias. Several strategies, including implementation of unique patient identifiers, adoption of standardized rules for inclusion/exclusion criteria, statistical procedures for data harmonization and analysis, and incorporation of patient-reported data have been used to reduce the impact of these biases.

Summary: EHR data provide an opportunity to monitor and characterize CVD risk in populations. However, understanding the biases that arise from EHR datasets is instrumental in planning epidemiological studies and interpreting study findings. Strategies to reduce the impact of bias in the context of EHR data can increase the quality and utility of these data.

Keywords: bias; cardiovascular disease; electronic health record; epidemiology; risk factors.

Grants and funding

UM1 CA173642/CA/NCI NIH HHS/United States