Weighted mean difference statistics for paired data in the presence of missing values

Stat Methods Med Res. 2023 Oct;32(10):2033-2048. doi: 10.1177/09622802231192947. Epub 2023 Aug 30.

Abstract

Missing data is a common issue in many biomedical studies. Under a paired design, some subjects may have missing values in either one or both of the conditions due to loss of follow-up, insufficient biological samples, etc. Such partially paired data complicate statistical comparison of the distribution of the variable of interest between the two conditions. In this article, we propose a general class of test statistics based on the difference in weighted sample means without imposing any distributional or model assumption. An optimal weight is derived from this class of tests. Simulation studies show that our proposed test with the optimal weight performs well and outperforms existing methods in practical situations. Two cancer biomarker studies are provided for illustration.

Keywords: Paired data; biomarker data; mean difference tests; missing data; nonparametric tests.