Using canonical correlation analysis to discover genetic regulatory variants

PLoS One. 2010 May 13;5(5):e10395. doi: 10.1371/journal.pone.0010395.

Abstract

Background: Discovering genetic associations between genetic markers and gene expression levels can provide insight into gene regulation and, potentially, mechanisms of disease. Such analyses typically involve a linkage or association analysis in which expression data are used as phenotypes. This approach leads to a large number of multiple comparisons and may therefore lack power. We assess the potential of applying canonical correlation analysis to partitioned genomewide data as a method for discovering regulatory variants.

Methodology/principal findings: Simulations suggest that canonical correlation analysis has higher power than standard pairwise univariate regression to detect single nucleotide polymorphisms when the expression trait has low heritability. The increase in power is even greater under the recessive model. We demonstrate this approach using the Childhood Asthma Management Program data.

Conclusions/significance: Our approach reduces multiple comparisons and may provide insight into the complex relationships between genotype and gene expression.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Asthma / genetics
  • Child
  • Computer Simulation
  • Gene Regulatory Networks / genetics*
  • Genetic Association Studies*
  • Humans
  • Models, Statistical*
  • Polymorphism, Single Nucleotide / genetics
  • Statistics as Topic