ReQTL: identifying correlations between expressed SNVs and gene expression using RNA-sequencing data

Bioinformatics. 2020 Mar 1;36(5):1351-1359. doi: 10.1093/bioinformatics/btz750.

Abstract

Motivation: By testing for associations between DNA genotypes and gene expression levels, expression quantitative trait locus (eQTL) analyses have been instrumental in understanding how thousands of single nucleotide variants (SNVs) may affect gene expression. As compared to DNA genotypes, RNA genetic variation represents a phenotypic trait that reflects the actual allele content of the studied system. RNA genetic variation at expressed SNV loci can be estimated using the proportion of alleles bearing the variant nucleotide (variant allele fraction, VAFRNA). VAFRNA is a continuous measure which allows for precise allele quantitation in loci where the RNA alleles do not scale with the genotype count. We describe a method to correlate VAFRNA with gene expression and assess its ability to identify genetically regulated expression solely from RNA-sequencing (RNA-seq) datasets.

Results: We introduce ReQTL, an eQTL modification which substitutes the DNA allele count for the variant allele fraction at expressed SNV loci in the transcriptome (VAFRNA). We exemplify the method on sets of RNA-seq data from human tissues obtained though the Genotype-Tissue Expression (GTEx) project and demonstrate that ReQTL analyses are computationally feasible and can identify a subset of expressed eQTL loci.

Availability and implementation: A toolkit to perform ReQTL analyses is available at https://github.com/HorvathLab/ReQTL.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Humans
  • Nucleotides
  • Polymorphism, Single Nucleotide
  • Quantitative Trait Loci
  • RNA*
  • Sequence Analysis, RNA
  • Software*

Substances

  • Nucleotides
  • RNA