Rare variant association testing for multicategory phenotype

Genet Epidemiol. 2019 Sep;43(6):646-656. doi: 10.1002/gepi.22210. Epub 2019 May 13.

Abstract

Genetic association studies have provided new insights into the genetic variability of human complex traits with a focus mainly on continuous or binary traits. Methods have been proposed to take into account disease heterogeneity between subgroups of patients when studying common variants but none was specifically designed for rare variants. Because rare variants are expected to have stronger effects and to be more heterogeneously distributed among cases than common ones, subgroup analyses might be particularly attractive in this context. To address this issue, we propose an extension of burden tests by using a multinomial regression model, which enables association tests between rare variants and multicategory phenotypes. We evaluated the type I error and the power of two burden tests, CAST and WSS, by simulating data under different scenarios. In the case of genetic heterogeneity between case subgroups, we showed an advantage of multinomial regression over logistic regression, which considers all the cases against the controls. We replicated these results on real data from Moyamoya disease where the burden tests performed better when cases were stratified according to age-of-onset. We implemented the functions for association tests in the R package "Ravages" available on Github.

Keywords: association; burden tests; disease severity; rare variant; subphenotypes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Age of Onset
  • Case-Control Studies
  • Cerebrovascular Disorders / genetics*
  • Computer Simulation / standards*
  • Data Interpretation, Statistical
  • Genetic Association Studies*
  • Genetic Variation*
  • Humans
  • Logistic Models
  • Models, Genetic*
  • Moyamoya Disease / genetics*
  • Multifactorial Inheritance / genetics*
  • Phenotype
  • Prognosis
  • Severity of Illness Index