Annocript: a flexible pipeline for the annotation of transcriptomes able to identify putative long noncoding RNAs

Bioinformatics. 2015 Jul 1;31(13):2199-201. doi: 10.1093/bioinformatics/btv106. Epub 2015 Feb 19.

Abstract

The eukaryotic transcriptome is composed of thousands of coding and long non-coding RNAs (lncRNAs). However, we lack a software platform to identify both RNA classes in a given transcriptome. Here we introduce Annocript, a pipeline that combines the annotation of protein coding transcripts with the prediction of putative lncRNAs in whole transcriptomes. It downloads and indexes the needed databases, runs the analysis and produces human readable and standard outputs together with summary statistics of the whole analysis.

Availability and implementation: Annocript is distributed under the GNU General Public License (version 3 or later) and is freely available at https://github.com/frankMusacchia/Annocript.

Contact: remo.sanges@szn.it.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Humans
  • Molecular Sequence Annotation*
  • RNA, Long Noncoding / genetics*
  • Sequence Analysis, RNA / methods*
  • Software*
  • Transcriptome*

Substances

  • RNA, Long Noncoding