ASAPA: a bioinformatic pipeline based on Iso-Seq that identifies the links among alternative splicing, alternative transcription initiation and alternative polyadenylation

Funct Integr Genomics. 2024 Mar 26;24(2):67. doi: 10.1007/s10142-024-01332-z.

Abstract

Background: Although the events associated with alternative splicing (AS), alternative polyadenylation (APA) and alternative transcription initiation (ATI) can be identified by many approaches based on isoform sequencing (Iso-Seq), these analyses are generally independent of each other and the links between these events are still rarely mentioned. However, an interdependency analysis can be achieved because the transcriptional start site, splice sites and polyA site could be simultaneously included in a long, full-length read from Iso-Seq.

Results: We create ASAPA pipeline that enables streamlined analysis for a robust detection of potential links among AS, ATI and APA using Iso-Seq data. We tested this pipeline using Arabidopsis data and found some interesting results: some adjacent introns tend to be simultaneously spliced or retained; coupling between AS and ATI or APA is limited to the initial or terminal intron; and ATI and APA are potentially linked in some special cases.

Conclusion: Our pipeline enables streamlined analysis for a robust detection of potential links among AS, ATI and APA using Iso-Seq data, which is conducive to a better understanding of transcription landscape generation.

Keywords: Alternative polyadenylation; Alternative splicing; Alternative transcription initiation; Correlation analysis.

MeSH terms

  • Alternative Splicing*
  • Computational Biology
  • High-Throughput Nucleotide Sequencing
  • Polyadenylation*
  • Protein Isoforms / genetics

Substances

  • Protein Isoforms

Grants and funding