Design and analysis of multiple diseases genome-wide association studies without controls

Gene. 2012 Nov 15;510(1):87-92. doi: 10.1016/j.gene.2012.07.089. Epub 2012 Aug 23.

Abstract

In genome-wide association studies (GWAS), multiple diseases with shared controls is one of the case-control study designs. If data obtained from these studies are appropriately analyzed, this design can have several advantages such as improving statistical power in detecting associations and reducing the time and cost in the data collection process. In this paper, we propose a study design for GWAS which involves multiple diseases but without controls. We also propose corresponding statistical data analysis strategy for GWAS with multiple diseases but no controls. Through a simulation study, we show that the statistical association test with the proposed study design is more powerful than the test with single disease sharing common controls, and it has comparable power to the overall test based on the whole dataset including the controls. We also apply the proposed method to a real GWAS dataset to illustrate the methodologies and the advantages of the proposed design. Some possible limitations of this study design and testing method and their solutions are also discussed. Our findings indicate that the proposed study design and statistical analysis strategy could be more efficient than the usual case-control GWAS as well as those with shared controls.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Case-Control Studies
  • Chi-Square Distribution
  • Computer Simulation
  • Control Groups
  • Genetic Predisposition to Disease / genetics*
  • Genome, Human / genetics*
  • Genome-Wide Association Study / methods*
  • Genome-Wide Association Study / statistics & numerical data
  • Humans
  • Monte Carlo Method
  • Polymorphism, Single Nucleotide*
  • Research Design