Estimating inverse probability weights using super learner when weight-model specification is unknown in a marginal structural Cox model context

Stat Med. 2017 Jun 15;36(13):2032-2047. doi: 10.1002/sim.7266. Epub 2017 Feb 20.

Abstract

Correct specification of the inverse probability weighting (IPW) model is necessary for consistent inference from a marginal structural Cox model (MSCM). In practical applications, researchers are typically unaware of the true specification of the weight model. Nonetheless, IPWs are commonly estimated using parametric models, such as the main-effects logistic regression model. In practice, assumptions underlying such models may not hold and data-adaptive statistical learning methods may provide an alternative. Many candidate statistical learning approaches are available in the literature. However, the optimal approach for a given dataset is impossible to predict. Super learner (SL) has been proposed as a tool for selecting an optimal learner from a set of candidates using cross-validation. In this study, we evaluate the usefulness of a SL in estimating IPW in four different MSCM simulation scenarios, in which we varied the specification of the true weight model specification (linear and/or additive). Our simulations show that, in the presence of weight model misspecification, with a rich and diverse set of candidate algorithms, SL can generally offer a better alternative to the commonly used statistical learning approaches in terms of MSE as well as the coverage probabilities of the estimated effect in an MSCM. The findings from the simulation studies guided the application of the MSCM in a multiple sclerosis cohort from British Columbia, Canada (1995-2008), to estimate the impact of beta-interferon treatment in delaying disability progression. Copyright © 2017 John Wiley & Sons, Ltd.

Keywords: inverse-probability weighting; marginal structural models; multiple sclerosis; super learner; time-dependent confounding.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Data Interpretation, Statistical
  • Disease Progression
  • Humans
  • Interferon-beta / therapeutic use
  • Linear Models
  • Logistic Models
  • Models, Statistical*
  • Multiple Sclerosis / drug therapy
  • Proportional Hazards Models*
  • Treatment Outcome

Substances

  • Interferon-beta