Genome-wide association filtering using a highly locus-specific transmission/disequilibrium test

Hum Genet. 2010 Sep;128(3):325-44. doi: 10.1007/s00439-010-0854-z. Epub 2010 Jul 6.

Abstract

Multimarker transmission/disequilibrium tests (TDTs) are powerful association and linkage tests used to perform genome-wide filtering in the search for disease susceptibility loci. In contrast to case/control studies, they have a low rate of false positives for population stratification and admixture. However, the length of a region found in association with a disease is usually very large because of linkage disequilibrium (LD). Here, we define a multimarker proportional TDT (mTDT ( P )) designed to improve locus specificity in complex diseases that has good power compared to the most powerful multimarker TDTs. The test is a simple generalization of a multimarker TDT in which haplotype frequencies are used to weight the effect that each haplotype has on the whole measure. Two concepts underlie the features of the metric: the 'common disease, common variant' hypothesis and the decrease in LD with chromosomal distance. Because of this decrease, the frequency of haplotypes in strong LD with common disease variants decreases with increasing distance from the disease susceptibility locus. Thus, our haplotype proportional test has higher locus specificity than common multimarker TDTs that assume a uniform distribution of haplotype probabilities. Because of the common variant hypothesis, risk haplotypes at a given locus are relatively frequent and a metric that weights partial results for each haplotype by its frequency will be as powerful as the most powerful multimarker TDTs. Simulations and real data sets demonstrate that the test has good power compared with the best tests but has remarkably higher locus specificity, so that the association rate decreases at a higher rate with distance from a disease susceptibility or disease protective locus.

MeSH terms

  • Biostatistics
  • Databases, Genetic
  • Female
  • Genetic Markers
  • Genetic Predisposition to Disease
  • Genetics, Population
  • Genome-Wide Association Study / methods*
  • Genome-Wide Association Study / statistics & numerical data
  • Haplotypes
  • Humans
  • Linkage Disequilibrium*
  • Male
  • Models, Genetic
  • Oligonucleotide Array Sequence Analysis / statistics & numerical data
  • Polymorphism, Single Nucleotide

Substances

  • Genetic Markers