On information coded in gene-environment independence in case-control studies

Am J Epidemiol. 2011 Sep 15;174(6):736-43. doi: 10.1093/aje/kwr153. Epub 2011 Aug 9.

Abstract

For analysis of case-control genetic association studies, it has recently been shown that gene-environment independence in the population can be leveraged to increase efficiency for estimating gene-environment interaction effects in comparison with the standard prospective analysis. However, for the special case in which data on the binary phenotype and genetic and environmental risk factors can be summarized in a 2 × 2 × 2 table, the authors show here that there is no efficiency gain for estimating interaction effects, nor is there an efficiency gain for estimating the genetic and environmental main effects. This contrasts with the well-known result assuming that rare phenotype prevalence and gene-environment independence in the control population for the same data can lead to efficiency gain. This discrepancy is counterintuitive, since the 2 likelihoods are also approximately equal when the phenotype is rare. An explanation for the paradox based on a theoretical analysis is provided. Implications of these results for data analyses are also examined, and practical guidance on analyzing such case-control studies is offered.

Publication types

  • Comparative Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bias
  • Case-Control Studies
  • Environmental Exposure / adverse effects*
  • Environmental Illness / epidemiology
  • Environmental Illness / genetics*
  • Genetic Predisposition to Disease
  • Genotype
  • Humans
  • Models, Genetic*
  • Molecular Epidemiology / methods*
  • Research Design
  • Risk Factors