Nonlinear dimension reduction with Wright-Fisher kernel for genotype aggregation and association mapping

Bioinformatics. 2012 Sep 15;28(18):i375-i381. doi: 10.1093/bioinformatics/bts406.

Abstract

Motivation: Association tests based on next-generation sequencing data are often under-powered due to the presence of rare variants and large amount of neutral or protective variants. A successful strategy is to aggregate genetic information within meaningful single-nucleotide polymorphism (SNP) sets, e.g. genes or pathways, and test association on SNP sets. Many existing methods for group-wise tests require specific assumptions about the direction of individual SNP effects and/or perform poorly in the presence of interactions.

Results: We propose a joint association test strategy based on two key components: a nonlinear supervised dimension reduction approach for effective SNP information aggregation and a novel kernel specially designed for qualitative genotype data. The new test demonstrates superior performance in identifying causal genes over existing methods across a large variety of disease models simulated from sequence data of real genes. In general, the proposed method provides an association test strategy that can (i) detect both rare and common causal variants, (ii) deal with both additive and interaction effect, (iii) handle both quantitative traits and disease dichotomies and (iv) incorporate non-genetic covariates. In addition, the new kernel can potentially boost the power of the entire family of kernel-based methods for genetic data analysis.

Availability: The method is implemented in MATLAB. Source code is available upon request.

Contact: hongjie.zhu@duke.edu.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms*
  • Genetic Association Studies*
  • Genotype
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Markov Chains
  • Polymorphism, Single Nucleotide*