The Rare-Variant Generalized Disequilibrium Test for Association Analysis of Nuclear and Extended Pedigrees with Application to Alzheimer Disease WGS Data

Am J Hum Genet. 2017 Feb 2;100(2):193-204. doi: 10.1016/j.ajhg.2016.12.001. Epub 2017 Jan 5.

Abstract

Whole-genome and exome sequence data can be cost-effectively generated for the detection of rare-variant (RV) associations in families. Causal variants that aggregate in families usually have larger effect sizes than those found in sporadic cases, so family-based designs can be a more powerful approach than population-based designs. Moreover, some family-based designs are robust to confounding due to population admixture or substructure. We developed a RV extension of the generalized disequilibrium test (GDT) to analyze sequence data obtained from nuclear and extended families. The GDT utilizes genotype differences of all discordant relative pairs to assess associations within a family, and the RV extension combines the single-variant GDT statistic over a genomic region of interest. The RV-GDT has increased power by efficiently incorporating information beyond first-degree relatives and allows for the inclusion of covariates. Using simulated genetic data, we demonstrated that the RV-GDT method has well-controlled type I error rates, even when applied to admixed populations and populations with substructure. It is more powerful than existing family-based RV association methods, particularly for the analysis of extended pedigrees and pedigrees with missing data. We analyzed whole-genome sequence data from families affected by Alzheimer disease to illustrate the application of the RV-GDT. Given the capability of the RV-GDT to adequately control for population admixture or substructure and analyze pedigrees with missing genotype data and its superior power over other family-based methods, it is an effective tool for elucidating the involvement of RVs in the etiology of complex traits.

Keywords: Alzheimer disease; TNK1; exome sequence data; extended and nuclear pedigrees; family-based association analysis; generalized disequilibrium test; missing data; pedigree disequilibrium test; rare variants; whole-genome sequence data.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Aged
  • Aged, 80 and over
  • Alzheimer Disease / diagnosis
  • Alzheimer Disease / genetics*
  • Axin Protein / genetics
  • Axin Protein / metabolism
  • Computer Simulation
  • Databases, Genetic
  • Female
  • Genetic Variation*
  • Genotype
  • Haplotypes
  • Humans
  • Linkage Disequilibrium*
  • Male
  • Models, Genetic
  • Pedigree
  • Phenotype
  • Sequence Analysis, DNA / methods*

Substances

  • AXIN1 protein, human
  • Axin Protein