Computational Assessment of the Expression-modulating Potential for Non-coding Variants

Genomics Proteomics Bioinformatics. 2023 Jun;21(3):662-673. doi: 10.1016/j.gpb.2021.10.003. Epub 2021 Dec 7.

Abstract

Large-scale genome-wide association studies (GWAS) and expression quantitative trait locus (eQTL) studies have identified multiple non-coding variants associated with genetic diseases by affecting gene expression. However, pinpointing causal variants effectively and efficiently remains a serious challenge. Here, we developed CARMEN, a novel algorithm to identify functional non-coding expression-modulating variants. Multiple evaluations demonstrated CARMEN's superior performance over state-of-the-art tools. Applying CARMEN to GWAS and eQTL datasets further pinpointed several causal variants other than the reported lead single-nucleotide polymorphisms (SNPs). CARMEN scales well with the massive datasets, and is available online as a web server at http://carmen.gao-lab.org.

Keywords: Algorithm; Expression-modulating variant; Gene regulation; Non-coding variant; Web server.

MeSH terms

  • Algorithms
  • Genetic Predisposition to Disease
  • Genome-Wide Association Study*
  • Humans
  • Polymorphism, Single Nucleotide
  • Quantitative Trait Loci*