Models with interactions overestimated heterogeneity of treatment effects and were prone to treatment mistargeting

J Clin Epidemiol. 2019 Oct:114:72-83. doi: 10.1016/j.jclinepi.2019.05.029. Epub 2019 Jun 10.

Abstract

Objectives: We aimed to compare the performance of different regression modeling approaches for the prediction of heterogeneous treatment effects.

Study design and setting: We simulated trial samples (n = 3,600; 80% power for a treatment odds ratio of 0.8) from a superpopulation (N = 1,000,000) with 12 binary risk predictors, both without and with six true treatment interactions. We assessed predictions of treatment benefit for four regression models: a "risk model" (with a constant effect of treatment assignment) and three "effect models" (including interactions of risk predictors with treatment assignment). Three novel performance measures were evaluated: calibration for benefit (i.e., observed vs. predicted risk difference in treated vs. untreated), discrimination for benefit, and prediction error for benefit.

Results: The risk modeling approach was well-calibrated for benefit, whereas effect models were consistently overfit, even with doubled sample sizes. Penalized regression reduced miscalibration of the effect models considerably. In terms of discrimination and prediction error, the risk modeling approach was superior in the absence of true treatment effect interactions, whereas penalized regression was optimal in the presence of true treatment interactions.

Conclusion: A risk modeling approach yields models consistently well calibrated for benefit. Effect modeling may improve discrimination for benefit in the presence of true interactions but is prone to overfitting. Hence, effect models-including only plausible interactions-should be fitted using penalized regression.

Keywords: Heterogeneity of treatment effect; Penalized regression analysis; Personalized medicine; Prediction models; Regression analysis; Treatment benefit.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Calibration
  • Coronary Artery Bypass / mortality
  • Coronary Artery Disease / surgery
  • Humans
  • Models, Statistical*
  • Odds Ratio
  • Percutaneous Coronary Intervention / mortality
  • Precision Medicine / statistics & numerical data
  • Randomized Controlled Trials as Topic / statistics & numerical data*
  • Regression Analysis*
  • Risk Assessment
  • Risk Factors
  • Sample Size
  • Simulation Training
  • Treatment Outcome*