Rare variant association test with multiple phenotypes

Genet Epidemiol. 2017 Apr;41(3):198-209. doi: 10.1002/gepi.22021. Epub 2016 Dec 31.

Abstract

Although genome-wide association studies (GWAS) have now discovered thousands of genetic variants associated with common traits, such variants cannot explain the large degree of "missing heritability," likely due to rare variants. The advent of next generation sequencing technology has allowed rare variant detection and association with common traits, often by investigating specific genomic regions for rare variant effects on a trait. Although multiple correlated phenotypes are often concurrently observed in GWAS, most studies analyze only single phenotypes, which may lessen statistical power. To increase power, multivariate analyses, which consider correlations between multiple phenotypes, can be used. However, few existing multivariant analyses can identify rare variants for assessing multiple phenotypes. Here, we propose Multivariate Association Analysis using Score Statistics (MAAUSS), to identify rare variants associated with multiple phenotypes, based on the widely used sequence kernel association test (SKAT) for a single phenotype. We applied MAAUSS to whole exome sequencing (WES) data from a Korean population of 1,058 subjects to discover genes associated with multiple traits of liver function. We then assessed validation of those genes by a replication study, using an independent dataset of 3,445 individuals. Notably, we detected the gene ZNF620 among five significant genes. We then performed a simulation study to compare MAAUSS's performance with existing methods. Overall, MAAUSS successfully conserved type 1 error rates and in many cases had a higher power than the existing methods. This study illustrates a feasible and straightforward approach for identifying rare variants correlated with multiple phenotypes, with likely relevance to missing heritability.

Keywords: SKAT; association test; exome sequencing data; multivariate analysis; rare variants.

MeSH terms

  • Genetic Predisposition to Disease*
  • Genetic Variation / genetics*
  • Genome-Wide Association Study*
  • High-Throughput Nucleotide Sequencing*
  • Humans
  • Liver Diseases / epidemiology
  • Liver Diseases / genetics*
  • Models, Genetic
  • Multivariate Analysis
  • Phenotype*
  • Republic of Korea / epidemiology