Learning About Missing Data Mechanisms in Electronic Health Records-based Research: A Survey-based Approach

Epidemiology. 2016 Jan;27(1):82-90. doi: 10.1097/EDE.0000000000000393.

Abstract

Background: Bias due to missing data is a major concern in electronic health record (EHR)-based research. As part of an ongoing EHR-based study of weight change among patients treated for depression, we conducted a survey to investigate determinants of missingness in the available weight information and to evaluate the missing-at-random assumption.

Methods: We identified 8,345 individuals enrolled in a large EHR-based health care system who had monotherapy treatment for depression from April 2008 to March 2010. A stratified sample of 1,153 individuals completed a detailed survey. Logistic regression was used to investigate determinants of whether a patient (1) had an opportunity to be weighed at treatment initiation (baseline), and (2) had a weight measurement recorded. Parallel analyses were conducted to investigate missingness during follow-up. Throughout, inverse-probability weighting was used to adjust for the design and survey nonresponse. Analyses were also conducted to investigate potential recall bias.

Results: Missingness at baseline and during follow-up was associated with numerous factors not routinely collected in the EHR including whether or not the patient had ever chosen not to be weighed, external weight control activities, and self-reported baseline weight. Patient attitudes about their weight and perceptions regarding the potential impact of their depression treatment on weight were not related to missingness.

Conclusion: Adopting a comprehensive strategy to investigate missingness early in the research process gives researchers information necessary to evaluate key assumptions. While the survey presented focuses on outcome data, the overarching strategy can be applied to any and all data elements subject to missingness.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Adolescent
  • Adult
  • Aged
  • Antidepressive Agents / adverse effects*
  • Antidepressive Agents / therapeutic use
  • Bias
  • Depression / drug therapy*
  • Electronic Health Records*
  • Epidemiologic Research Design*
  • Female
  • Health Care Surveys
  • Humans
  • Logistic Models
  • Male
  • Middle Aged
  • Retrospective Studies
  • Weight Gain / drug effects*
  • Weight Loss / drug effects*
  • Young Adult

Substances

  • Antidepressive Agents