Binary variable multiple-model multiple imputation to address missing data mechanism uncertainty: application to a smoking cessation trial

Stat Med. 2014 Jul 30;33(17):3013-28. doi: 10.1002/sim.6137. Epub 2014 Mar 17.

Abstract

The true missing data mechanism is never known in practice. We present a method for generating multiple imputations for binary variables, which formally incorporates missing data mechanism uncertainty. Imputations are generated from a distribution of imputation models rather than a single model, with the distribution reflecting subjective notions of missing data mechanism uncertainty. Parameter estimates and standard errors are obtained using rules for nested multiple imputation. Using simulation, we investigate the impact of missing data mechanism uncertainty on post-imputation inferences and show that incorporating this uncertainty can increase the coverage of parameter estimates. We apply our method to a longitudinal smoking cessation trial where nonignorably missing data were a concern. Our method provides a simple approach for formalizing subjective notions regarding nonresponse and can be implemented using existing imputation software.

Keywords: NMAR; binary data; nonignorable; not missing at random.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Computer Simulation
  • Humans
  • Models, Statistical*
  • Smoking Cessation / methods
  • Uncertainty*