Testing for genetic associations in arbitrarily structured populations

Nat Genet. 2015 May;47(5):550-4. doi: 10.1038/ng.3244. Epub 2015 Mar 30.

Abstract

We present a new statistical test of association between a trait and genetic markers, which we theoretically and practically prove to be robust to arbitrarily complex population structure. The statistical test involves a set of parameters that can be directly estimated from large-scale genotyping data, such as those measured in genome-wide association studies (GWAS). We also derive a new set of methodologies, called a 'genotype-conditional association test' (GCAT), shown to provide accurate association tests in populations with complex structures, manifested in both the genetic and non-genetic contributions to the trait. We demonstrate the proposed method on a large simulation study and on the Northern Finland Birth Cohort study. In the Finland study, we identify several new significant loci that other methods do not detect. Our proposed framework provides a substantially different approach to the problem from existing methods, such as the linear mixed-model and principal-component approaches.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Computer Simulation
  • Genome-Wide Association Study*
  • Humans
  • Linear Models
  • Models, Genetic
  • Polymorphism, Single Nucleotide
  • Principal Component Analysis

Associated data

  • dbGaP/PHS000276.V2.P1