A primer on quantitative bias analysis with positive predictive values in research using electronic health data

J Am Med Inform Assoc. 2019 Dec 1;26(12):1664-1674. doi: 10.1093/jamia/ocz094.

Abstract

Objective: In health informatics, there have been concerns with reuse of electronic health data for research, including potential bias from incorrect or incomplete outcome ascertainment. In this tutorial, we provide a concise review of predictive value-based quantitative bias analysis (QBA), which comprises epidemiologic methods that use estimates of data quality accuracy to quantify the bias caused by outcome misclassification.

Target audience: Health informaticians and investigators reusing large, electronic health data sources for research.

Scope: When electronic health data are reused for research, validation of outcome case definitions is recommended, and positive predictive values (PPVs) are the most commonly reported measure. Typically, case definitions with high PPVs are considered to be appropriate for use in research. However, in some studies, even small amounts of misclassification can cause bias. In this tutorial, we introduce methods for quantifying this bias that use predictive values as inputs. Using epidemiologic principles and examples, we first describe how multiple factors influence misclassification bias, including outcome misclassification levels, outcome prevalence, and whether outcome misclassification levels are the same or different by exposure. We then review 2 predictive value-based QBA methods and why outcome PPVs should be stratified by exposure for bias assessment. Using simulations, we apply and evaluate the methods in hypothetical electronic health record-based immunization schedule safety studies. By providing an overview of predictive value-based QBA, we hope to bridge the disciplines of health informatics and epidemiology to inform how the impact of data quality issues can be quantified in research using electronic health data sources.

Keywords: bias; electronic health records; medical informatics; outcome assessment.

Publication types

  • Research Support, N.I.H., Extramural
  • Review

MeSH terms

  • Bias*
  • Electronic Health Records*
  • Epidemiologic Methods
  • Humans
  • Medical Informatics
  • Outcome Assessment, Health Care*
  • Predictive Value of Tests
  • Public Health Surveillance
  • Reproducibility of Results
  • Sensitivity and Specificity