Estimation of propensity scores using generalized additive models

Stat Med. 2008 Aug 30;27(19):3805-16. doi: 10.1002/sim.3278.

Abstract

Propensity score matching is often used in observational studies to create treatment and control groups with similar distributions of observed covariates. Typically, propensity scores are estimated using logistic regressions that assume linearity between the logistic link and the predictors. We evaluate the use of generalized additive models (GAMs) for estimating propensity scores. We compare logistic regressions and GAMs in terms of balancing covariates using simulation studies with artificial and genuine data. We find that, when the distributions of covariates in the treatment and control groups overlap sufficiently, using GAMs can improve overall covariate balance, especially for higher-order moments of distributions. When the distributions in the two groups overlap insufficiently, GAM more clearly reveals this fact than logistic regression does. We also demonstrate via simulation that matching with GAMs can result in larger reductions in bias when estimating treatment effects than matching with logistic regression.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Analysis of Variance
  • Computer Simulation
  • Confounding Factors, Epidemiologic
  • Humans
  • Logistic Models*
  • Observation
  • Randomized Controlled Trials as Topic / methods*
  • Treatment Outcome