A convex formulation for joint RNA isoform detection and quantification from multiple RNA-seq samples

BMC Bioinformatics. 2015 Aug 19:16:262. doi: 10.1186/s12859-015-0695-9.

Abstract

Background: Detecting and quantifying isoforms from RNA-seq data is an important but challenging task. The problem is often ill-posed, particularly at low coverage. One promising direction is to exploit several samples simultaneously.

Results: We propose a new method for solving the isoform deconvolution problem jointly across several samples. We formulate a convex optimization problem that allows to share information between samples and that we solve efficiently. We demonstrate the benefits of combining several samples on simulated and real data, and show that our approach outperforms pooling strategies and methods based on integer programming.

Conclusion: Our convex formulation to jointly detect and quantify isoforms from RNA-seq data of multiple related samples is a computationally efficient approach to leverage the hypotheses that some isoforms are likely to be present in several samples. The software and source code are available at http://cbio.ensmp.fr/flipflop.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Alternative Splicing
  • Humans
  • Internet
  • RNA / metabolism*
  • RNA Isoforms / analysis*
  • RNA Isoforms / metabolism
  • Sequence Analysis, RNA
  • Transcriptome
  • User-Computer Interface

Substances

  • RNA Isoforms
  • RNA