DifferentialRegulation: a Bayesian hierarchical approach to identify differentially regulated genes

bioRxiv [Preprint]. 2023 Aug 17:2023.08.17.553679. doi: 10.1101/2023.08.17.553679.

Abstract

Motivation: Although transcriptomics data is typically used to analyse mature spliced mRNA, recent attention has focused on jointly investigating spliced and unspliced (or precursor-) mRNA, which can be used to study gene regulation and changes in gene expression production. Nonetheless, most methods for spliced/unspliced inference (such as RNA velocity tools) focus on individual samples, and rarely allow comparisons between groups of samples (e.g., healthy vs. diseased). Furthermore, this kind of inference is challenging, because spliced and unspliced mRNA abundance is characterized by a high degree of quantification uncertainty, due to the prevalence of multi-mapping reads, i.e., reads compatible with multiple transcripts (or genes), and/or with both their spliced and unspliced versions.

Results: Here, we present DifferentialRegulation, a Bayesian hierarchical method to discover changes between experimental conditions with respect to the relative abundance of unspliced mRNA (over the total mRNA). We model the quantification uncertainty via a latent variable approach, where reads are allocated to their gene/transcript of origin, and to the respective splice version. We designed several benchmarks where our approach shows good performance, in terms of sensitivity and error control, versus state-of-the-art competitors. Importantly, our tool is flexible, and works with both bulk and single-cell RNA-sequencing data.

Availability and implementation: DifferentialRegulation is distributed as a Bioconductor R package.

Keywords: Bayesian hierarchical model; Gene expression; Gene regulation; Latent states; RNA-sequencing; Splicing; Statistical software tool; Transcriptomics.

Publication types

  • Preprint