Assessing Alternative Imputation Strategies for Infrequently Missing Items on Multi-item Scales

Panteha Hayati Rezvan; W Scott Comulada; M Isabel Fernández; Thomas R Belin

doi:10.1080/23737484.2022.2115430

Assessing Alternative Imputation Strategies for Infrequently Missing Items on Multi-item Scales

Commun Stat Case Stud Data Anal Appl. 2022;8(4):682-713. doi: 10.1080/23737484.2022.2115430. Epub 2022 Sep 1.

Authors

Panteha Hayati Rezvan¹, W Scott Comulada^{1

2}, M Isabel Fernández³, Thomas R Belin^{1

4}

Affiliations

¹ Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, U.S.A.
² Department of Health Policy and Management, UCLA Fielding School of Public Health, Los Angeles, California, U.S.A.
³ College of Osteopathic Medicine, Nova Southeastern University, Miami, Florida, U.S.A.
⁴ Department of Biostatistics, UCLA Fielding School of Public Health, Los Angeles, California, U.S.A.

Abstract

Health-science researchers often measure psychological constructs using multi-item scales and encounter missing items on some participants. Multiple imputation (MI) has emerged as an alternative to ad-hoc methods (e.g., mean substitution) for handling incomplete data on multi-item scales, appealingly reflecting available information while accounting for uncertainty due to missing values in a unified inferential framework. However, MI can be implemented in a variety of ways. When the number of variables to impute gets large, some strategies yield unstable estimates of quantities of interest while others are not technically feasible to implement. These considerations raise pragmatic questions about the extent to which ad-hoc procedures would yield statistical properties that are competitive with theoretically motivated methods. Drawing on an HIV study where depression and anxiety symptoms are measured with multi-item scales, this empirical investigation contrasts ad-hoc methods for handling missing items with various MI implementations that differ as to whether imputation is at the item-level or scale-level and how auxiliary variables are incorporated. While the findings are consistent with previous reports favoring item-level imputation when feasible to implement, we found only subtle differences in statistical properties across procedures, suggesting that weaknesses of ad-hoc procedures may be muted when missing data percentages are modest.

Keywords: Missing data; Multi-item scale; Multiple imputation.

Abstract

Grants and funding