Valid statistical inference methods for a case-control study with missing data

Guo-Liang Tian; Chi Zhang; Xuejun Jiang

doi:10.1177/0962280216649619

Valid statistical inference methods for a case-control study with missing data

Stat Methods Med Res. 2018 Apr;27(4):1001-1023. doi: 10.1177/0962280216649619. Epub 2016 May 19.

Authors

Guo-Liang Tian¹, Chi Zhang², Xuejun Jiang¹

Affiliations

¹ 1 Department of Mathematics, South University of Science and Technology of China, Shenzhen City, Guangdong, P. R. China.
² 2 Department of Statistics and Actuarial Science, The University of Hong Kong, Pokfulam Road, Hong Kong, P. R. China.

PMID: 27199234
DOI: 10.1177/0962280216649619

Abstract

The main objective of this paper is to derive the valid sampling distribution of the observed counts in a case-control study with missing data under the assumption of missing at random by employing the conditional sampling method and the mechanism augmentation method. The proposed sampling distribution, called the case-control sampling distribution, can be used to calculate the standard errors of the maximum likelihood estimates of parameters via the Fisher information matrix and to generate independent samples for constructing small-sample bootstrap confidence intervals. Theoretical comparisons of the new case-control sampling distribution with two existing sampling distributions exhibit a large difference. Simulations are conducted to investigate the influence of the three different sampling distributions on statistical inferences. One finding is that the conclusion by the Wald test for testing independency under the two existing sampling distributions could be completely different (even contradictory) from the Wald test for testing the equality of the success probabilities in control/case groups under the proposed distribution. A real cervical cancer data set is used to illustrate the proposed statistical methods.

Keywords: Bootstrap methods; Wald test; case–control study; missing at random; the mechanism augmentation method.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Bias*
Case-Control Studies
Likelihood Functions
Models, Statistical
Vital Statistics*