Network regression analysis in transcriptome-wide association studies

BMC Genomics. 2022 Aug 6;23(1):562. doi: 10.1186/s12864-022-08809-w.

Abstract

Background: Transcriptome-wide association studies (TWASs) have shown great promise in interpreting the findings from genome-wide association studies (GWASs) and exploring the disease mechanisms, by integrating GWAS and eQTL mapping studies. Almost all TWAS methods only focus on one gene at a time, with exception of only two published multiple-gene methods nevertheless failing to account for the inter-dependence as well as the network structure among multiple genes, which may lead to power loss in TWAS analysis as complex disease often owe to multiple genes that interact with each other as a biological network. We therefore developed a Network Regression method in a two-stage TWAS framework (NeRiT) to detect whether a given network is associated with the traits of interest. NeRiT adopts the flexible Bayesian Dirichlet process regression to obtain the gene expression prediction weights in the first stage, uses pointwise mutual information to represent the general between-node correlation in the second stage and can effectively take the network structure among different gene nodes into account.

Results: Comprehensive and realistic simulations indicated NeRiT had calibrated type I error control for testing both the node effect and edge effect, and yields higher power than the existed methods, especially in testing the edge effect. The results were consistent regardless of the GWAS sample size, the gene expression prediction model in the first step of TWAS, the network structure as well as the correlation pattern among different gene nodes. Real data applications through analyzing systolic blood pressure and diastolic blood pressure from UK Biobank showed that NeRiT can simultaneously identify the trait-related nodes as well as the trait-related edges.

Conclusions: NeRiT is a powerful and efficient network regression method in TWAS.

Keywords: Biological networks; Blood pressure; Dirichlet process regression; Pointwise mutual information; TWAS.

MeSH terms

  • Bayes Theorem
  • Genetic Predisposition to Disease
  • Genome-Wide Association Study* / methods
  • Humans
  • Polymorphism, Single Nucleotide
  • Quantitative Trait Loci
  • Regression Analysis
  • Transcriptome*