Augmenting transcriptome assembly by combining de novo and genome-guided tools

PeerJ. 2013 Aug 15:1:e133. doi: 10.7717/peerj.133. eCollection 2013.

Abstract

Researchers interested in studying and constructing transcriptomes, especially for non-model species, face the conundrum of choosing from a number of available de novo and genome-guided assemblers. None of the popular assembly tools in use today achieve requisite sensitivity, specificity or recovery of full-length transcripts on their own. Here, we present a comprehensive comparative study of the performance of various assemblers. Additionally, we present an approach to combinatorially augment transciptome assembly by using both de novo and genome-guided tools. In our study, we obtained the best recovery and most full-length transcripts with Trinity and TopHat1-Cufflinks, respectively. The sensitivity of the assembly and isoform recovery was superior, without compromising much on the specificity, when transcripts from Trinity were augmented with those from TopHat1-Cufflinks.

Keywords: De novo transcriptome assembly; Genome-guided transcriptome assembly; Misassembly; Model assembly; Transcriptome; Transcriptome assembly.

Grants and funding

This research is funded by the Department of Electronics and Information Technology, Government of India (Ref No: 18(4)/2010-E-Infra., 31-03-2010) and the Department of Information Technology, Biotechnology & Science and Technology, Government of Karnataka (Ref No: 3451-00-090-2-22). Both the grants were received by the Bio-IT Centre. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.