Limitations in Using Multiple Imputation to Harmonize Individual Participant Data for Meta-Analysis

Prev Sci. 2018 Feb;19(Suppl 1):95-108. doi: 10.1007/s11121-017-0760-x.

Abstract

Individual participant data (IPD) meta-analysis is a meta-analysis in which the individual-level data for each study are obtained and used for synthesis. A common challenge in IPD meta-analysis is when variables of interest are measured differently in different studies. The term harmonization has been coined to describe the procedure of placing variables on the same scale in order to permit pooling of data from a large number of studies. Using data from an IPD meta-analysis of 19 adolescent depression trials, we describe a multiple imputation approach for harmonizing 10 depression measures across the 19 trials by treating those depression measures that were not used in a study as missing data. We then apply diagnostics to address the fit of our imputation model. Even after reducing the scale of our application, we were still unable to produce accurate imputations of the missing values. We describe those features of the data that made it difficult to harmonize the depression measures and provide some guidelines for using multiple imputation for harmonization in IPD meta-analysis.

Keywords: Data synthesis; Individual participant data meta-analysis; Multiple imputation; Posterior predictive checking.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Adolescent
  • Bias*
  • Big Data
  • Child
  • Data Analysis*
  • Depression
  • Female
  • Humans
  • Male
  • Meta-Analysis as Topic*
  • Research Subjects* / statistics & numerical data