Discovering chimeric transcripts in paired-end RNA-seq data by using EricScript

Bioinformatics. 2012 Dec 15;28(24):3232-9. doi: 10.1093/bioinformatics/bts617. Epub 2012 Oct 23.

Abstract

Motivation: The discovery of novel gene fusions can lead to a better comprehension of cancer progression and development. The emergence of deep sequencing of trancriptome, known as RNA-seq, has opened many opportunities for the identification of this class of genomic alterations, leading to the discovery of novel chimeric transcripts in melanomas, breast cancers and lymphomas. Nowadays, few computational approaches have been developed for the detection of chimeric transcripts. Although all of these computational methods show good sensitivity, much work remains to reduce the huge number of false-positive calls that arises from this analysis.

Results: We proposed a novel computational framework, named chimEric tranScript detection algorithm (EricScript), for the identification of gene fusion products in paired-end RNA-seq data. Our simulation study on synthetic data demonstrates that EricScript enables to achieve higher sensitivity and specificity than existing methods with noticeably lower running times. We also applied our method to publicly available RNA-seq tumour datasets, and we showed its capability in rediscovering known gene fusions.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Cell Line, Tumor
  • Gene Expression Profiling
  • Gene Fusion*
  • Genomics
  • High-Throughput Nucleotide Sequencing*
  • Humans
  • RNA, Neoplasm / chemistry*
  • Sequence Alignment
  • Sequence Analysis, RNA / methods*

Substances

  • RNA, Neoplasm