FLAME: long-read bioinformatics tool for comprehensive spliceome characterization

RNA. 2021 Oct;27(10):1127-1139. doi: 10.1261/rna.078800.121. Epub 2021 Jul 12.

Abstract

Comprehensive characterization of differentially spliced RNA transcripts with nanopore sequencing is limited by bioinformatics tools that are reliant on existing annotations. We have developed FLAME, a bioinformatics pipeline for alternative splicing analysis of gene-specific or transcriptome-wide long-read sequencing data. FLAME is a Python-based tool aimed at providing comprehensible quantification of full-length splice variants, reliable de novo recognition of splice sites and exons, and representation of consecutive exon connectivity in the form of a weighted adjacency matrix. Notably, this workflow circumvents issues related to inadequate reference annotations and allows for incorporation of short-read sequencing data to improve the confidence of nanopore sequencing reads. In this study, the Epstein-Barr virus long noncoding RNA RPMS1 was used to demonstrate the utility of the pipeline. RPMS1 is ubiquitously expressed in Epstein-Barr virus associated cancer and known to undergo ample differential splicing. To fully resolve the RPMS1 spliceome, we combined gene-specific nanopore sequencing reads from a primary gastric adenocarcinoma and a nasopharyngeal carcinoma cell line with matched publicly available short-read sequencing data sets. All previously reported splice variants, including putative ORFs, were detected using FLAME. In addition, 32 novel exons, including two intron retentions and a cassette exon, were discovered within the RPMS1 gene.

Keywords: Epstein-Barr virus; RNA splicing; RPMS1; bioinformatics.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Benchmarking
  • Cell Line, Tumor
  • Computational Biology / methods
  • Epstein-Barr Virus Infections / genetics*
  • Epstein-Barr Virus Infections / metabolism
  • Epstein-Barr Virus Infections / pathology
  • Epstein-Barr Virus Infections / virology
  • Exons
  • Herpesvirus 4, Human / genetics*
  • Herpesvirus 4, Human / pathogenicity
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Introns
  • Nanopore Sequencing
  • Nasopharyngeal Carcinoma / genetics*
  • Nasopharyngeal Carcinoma / metabolism
  • Nasopharyngeal Carcinoma / pathology
  • Nasopharyngeal Carcinoma / virology
  • Nasopharyngeal Neoplasms / genetics*
  • Nasopharyngeal Neoplasms / metabolism
  • Nasopharyngeal Neoplasms / pathology
  • Nasopharyngeal Neoplasms / virology
  • RNA Splicing*
  • RNA, Long Noncoding / genetics*
  • RNA, Long Noncoding / metabolism
  • RNA, Messenger / genetics*
  • RNA, Messenger / metabolism
  • RNA, Viral / genetics
  • RNA, Viral / metabolism
  • Sequence Analysis, RNA
  • Software*

Substances

  • RNA, Long Noncoding
  • RNA, Messenger
  • RNA, Viral