Use of instrumental variables in electronic health record-driven models

Stat Methods Med Res. 2018 Feb;27(2):608-621. doi: 10.1177/0962280216641154. Epub 2016 Apr 7.

Abstract

Precision medicine presents various methodological challenges whose assessment requires the consideration of multiple factors. In particular, the data multitude in the Electronic Health Records poses interoperability issues and requires novel inference strategies. A problem, though apparently a paradox, is that highly specific treatments and a variety of outcomes may hardly match with consistent observations (i.e., large samples). Why is it the case? Owing to the heterogeneity of Electronic Health Records, models for the evaluation of treatment effects need to be selected, and in some cases, the use of instrumental variables might be necessary. We studied the recently defined person-centered treatment effects in cancer and C-section contexts from Electronic Health Record sources and identified as an instrument the distance of patients from hospitals. We present first the rationale for using such instrument and then its model implementation. While for cancer patients consideration of distance turns out to be a penalty, implying a negative effect on the probability of receiving surgery, a positive effect is instead found in C-section due to higher propensity of scheduling delivery. Overall, the estimated person-centered treatment effects reveal a high degree of heterogeneity, whose interpretation remains context-dependent. With regard to the use of instruments in light of our two case studies, our suggestion is that this process requires ad hoc variable selection for both covariates and instruments and additional testing to ensure validity.

Keywords: C-section; Precision medicine; cancer; electronic health record; local instrumental variables; person-centered treatment.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biostatistics
  • Cesarean Section / statistics & numerical data
  • Electronic Health Records / statistics & numerical data*
  • Female
  • Humans
  • Male
  • Models, Statistical
  • Precision Medicine / statistics & numerical data
  • Pregnancy
  • Prostatic Neoplasms / therapy
  • Regression Analysis