Calculating the most likely intron splicing orders in S. pombe, fruit fly, Arabidopsis thaliana, and humans

BMC Bioinformatics. 2020 Oct 24;21(1):478. doi: 10.1186/s12859-020-03818-6.

Abstract

Background: Introns have been shown to be spliced in a defined order, and this order influences both alternative splicing regulation and splicing fidelity, but previous studies have only considered neighbouring introns. The detailed intron splicing order remains unknown.

Results: In this work, a method was developed that can calculate the intron splicing orders of all introns in each transcript. A simulation study showed that this method can accurately calculate intron splicing orders. I further applied this method to real S. pombe, fruit fly, Arabidopsis thaliana, and human sequencing datasets and found that intron splicing orders change from gene to gene and that humans contain more not in-order spliced transcripts than S. pombe, fruit fly and Arabidopsis thaliana. In addition, I reconfirmed that the first introns in humans are spliced slower than those in S. pombe, fruit fly, and Arabidopsis thaliana genome-widely. Both the calculated most likely orders and the method developed here are available on the web.

Conclusions: A novel computational method was developed to calculate the intron splicing orders and applied the method to real sequencing datasets. I obtained intron splicing orders for hundreds or thousands of genes in four organisms. I found humans contain more number of not in-order spliced transcripts.

Keywords: Bayesian network; Intron splicing order; Most likely order; Splicing.

MeSH terms

  • Alternative Splicing
  • Animals
  • Arabidopsis / genetics*
  • Base Sequence
  • Computational Biology / methods*
  • Drosophila melanogaster / genetics*
  • Humans
  • Introns / genetics*
  • RNA Splicing / genetics*
  • Schizosaccharomyces / genetics*