A nonparametric test for association with multiple loci in the retrospective case-control study

Stat Methods Med Res. 2020 Feb;29(2):589-602. doi: 10.1177/0962280219842892. Epub 2019 Apr 16.

Abstract

The genome-wide association studies aim at identifying common or rare variants associated with common diseases and explaining more heritability. It is well known that common diseases are influenced by multiple single nucleotide polymorphisms (SNPs) that are usually correlated in location or function. In order to powerfully detect association signals, it is highly desirable to take account of correlations or linkage disequilibrium (LD) information among multiple SNPs in testing for association. In this article, we propose a test SLIDE that depicts the difference of the average multi-locus genotypes between cases and controls and derive its variance-covariance matrix in the retrospective design. This matrix is composed of the pairwise LD between SNPs. Thus SLIDE can borrow the strength from an external database in the population of interest with a few thousands to hundreds of thousands individuals to improve the power for detecting association. Extensive simulations show that SLIDE has apparent superiority over the existing methods, especially in the situation involving both common and rare variants, both protective and deleterious variants. Furthermore, the efficiency of the proposed method is demonstrated in the application to the data from the Wellcome Trust Case Control Consortium.

Keywords: Genome-wide association study; common disease; linkage disequilibrium; multiple loci; retrospective study.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Case-Control Studies
  • Disease / genetics*
  • Genetic Loci*
  • Genome-Wide Association Study* / statistics & numerical data
  • Humans
  • Polymorphism, Single Nucleotide
  • Retrospective Studies
  • Statistics, Nonparametric