sumSTAAR: A flexible framework for gene-based association studies using GWAS summary statistics

PLoS Comput Biol. 2022 Jun 2;18(6):e1010172. doi: 10.1371/journal.pcbi.1010172. eCollection 2022 Jun.

Abstract

Gene-based association analysis is an effective gene-mapping tool. Many gene-based methods have been proposed recently. However, their power depends on the underlying genetic architecture, which is rarely known in complex traits, and so it is likely that a combination of such methods could serve as a universal approach. Several frameworks combining different gene-based methods have been developed. However, they all imply a fixed set of methods, weights and functional annotations. Moreover, most of them use individual phenotypes and genotypes as input data. Here, we introduce sumSTAAR, a framework for gene-based association analysis using summary statistics obtained from genome-wide association studies (GWAS). It is an extended and modified version of STAAR framework proposed by Li and colleagues in 2020. The sumSTAAR framework offers a wider range of gene-based methods to combine. It allows the user to arbitrarily define a set of these methods, weighting functions and probabilities of genetic variants being causal. The methods used in the framework were adapted to analyse genes with large number of SNPs to decrease the running time. The framework includes the polygene pruning procedure to guard against the influence of the strong GWAS signals outside the gene. We also present new improved matrices of correlations between the genotypes of variants within genes. These matrices estimated on a sample of 265,000 individuals are a state-of-the-art replacement of widely used matrices based on the 1000 Genomes Project data.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Genetic Association Studies
  • Genome-Wide Association Study* / methods
  • Phenotype
  • Polymorphism, Single Nucleotide / genetics
  • Quantitative Trait Loci*

Grants and funding

NMB GRS AVK IVZ YAT TIA received the funding from a budget project of the Institute of Cytology and Genetics (project number FWNR-2022-0020). NMB GRS AVK IVZ TIA received funding from the Russian Foundation for Basic Research (20-04-00464, https://www.rfbr.ru) YAT received the funding from the program "5-100 Best Universities" of the Ministry of Science and Higher Education of the Russian Federation (https://www.5top100.ru/) The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.