SUMMIT-FA: a new resource for improved transcriptome imputation using functional annotations

Hum Mol Genet. 2024 Mar 20;33(7):624-635. doi: 10.1093/hmg/ddad205.

Abstract

Transcriptome-wide association studies (TWAS) integrate gene expression prediction models and genome-wide association studies (GWAS) to identify gene-trait associations. The power of TWAS is determined by the sample size of GWAS and the accuracy of the expression prediction model. Here, we present a new method, the Summary-level Unified Method for Modeling Integrated Transcriptome using Functional Annotations (SUMMIT-FA), which improves gene expression prediction accuracy by leveraging functional annotation resources and a large expression quantitative trait loci (eQTL) summary-level dataset. We build gene expression prediction models in whole blood using SUMMIT-FA with the comprehensive functional database MACIE and eQTL summary-level data from the eQTLGen consortium. We apply these models to GWAS for 24 complex traits and show that SUMMIT-FA identifies significantly more gene-trait associations and improves predictive power for identifying "silver standard" genes compared to several benchmark methods. We further conduct a simulation study to demonstrate the effectiveness of SUMMIT-FA.

Keywords: TWAS; eQTL Prediction; functional annotations; low-heritability Genes.

MeSH terms

  • Computer Simulation
  • Genetic Predisposition to Disease
  • Genome-Wide Association Study* / methods
  • Humans
  • Phenotype
  • Polymorphism, Single Nucleotide
  • Quantitative Trait Loci / genetics
  • Transcriptome* / genetics