Next generation modeling in GWAS: comparing different genetic architectures

Hum Genet. 2014 Oct;133(10):1235-53. doi: 10.1007/s00439-014-1461-1. Epub 2014 Jun 17.

Abstract

The continuous advancement in genotyping technology has not been accompanied by the application of innovative statistical methods, such as multi-marker methods (MMM), to unravel genetic associations with complex traits. Although the performance of MMM has been widely explored in a prediction context, little is known on their behavior in the quantitative trait loci (QTL) detection under complex genetic architectures. We shed light on this still open question by applying Bayes A (BA) and Bayesian LASSO (BL) to simulated and real data. Both methods were compared to the single marker regression (SMR). Simulated data were generated in the context of six scenarios differing on effect size, minor allele frequency (MAF) and linkage disequilibrium (LD) between QTLs. These were based on real SNP genotypes in chromosome 21 from the Spanish Bladder Cancer Study. We show how the genetic architecture dramatically affects the behavior of the methods in terms of power, type I error and accuracy of estimates. Markers with high MAF are easier to detect by all methods, especially if they have a large effect on the phenotypic trait. A high LD between QTLs with either large or small effects differently affects the power of the methods: it impairs QTL detection with BA, irrespectively of the effect size, although boosts that of small effects with BL and SMR. We demonstrate the convenience of applying MMM rather than SMR because of their larger power and smaller type I error. Results from real data when applying MMM suggest novel associations not detected by SMR.

Publication types

  • Comparative Study
  • Multicenter Study

MeSH terms

  • Alleles
  • Bayes Theorem
  • Case-Control Studies
  • Computer Simulation*
  • Gene Frequency
  • Genes, Neoplasm
  • Genome-Wide Association Study / statistics & numerical data*
  • Genotyping Techniques / methods*
  • Genotyping Techniques / statistics & numerical data
  • Hispanic or Latino / genetics
  • Hispanic or Latino / statistics & numerical data
  • Humans
  • Linkage Disequilibrium
  • Polymorphism, Single Nucleotide
  • Quantitative Trait Loci
  • Urinary Bladder Neoplasms / epidemiology
  • Urinary Bladder Neoplasms / genetics