Valid statistical inference methods for a case-control study with missing data

Stat Methods Med Res. 2018 Apr;27(4):1001-1023. doi: 10.1177/0962280216649619. Epub 2016 May 19.

Abstract

The main objective of this paper is to derive the valid sampling distribution of the observed counts in a case-control study with missing data under the assumption of missing at random by employing the conditional sampling method and the mechanism augmentation method. The proposed sampling distribution, called the case-control sampling distribution, can be used to calculate the standard errors of the maximum likelihood estimates of parameters via the Fisher information matrix and to generate independent samples for constructing small-sample bootstrap confidence intervals. Theoretical comparisons of the new case-control sampling distribution with two existing sampling distributions exhibit a large difference. Simulations are conducted to investigate the influence of the three different sampling distributions on statistical inferences. One finding is that the conclusion by the Wald test for testing independency under the two existing sampling distributions could be completely different (even contradictory) from the Wald test for testing the equality of the success probabilities in control/case groups under the proposed distribution. A real cervical cancer data set is used to illustrate the proposed statistical methods.

Keywords: Bootstrap methods; Wald test; case–control study; missing at random; the mechanism augmentation method.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bias*
  • Case-Control Studies
  • Likelihood Functions
  • Models, Statistical
  • Vital Statistics*