LeARN: a platform for detecting, clustering and annotating non-coding RNAs

Céline Noirot; Christine Gaspin; Thomas Schiex; Jérôme Gouzy

doi:10.1186/1471-2105-9-21

LeARN: a platform for detecting, clustering and annotating non-coding RNAs

BMC Bioinformatics. 2008 Jan 14:9:21. doi: 10.1186/1471-2105-9-21.

Authors

Céline Noirot¹, Christine Gaspin, Thomas Schiex, Jérôme Gouzy

Affiliation

¹ Laboratoire Interactions Plantes Micro-organismes UMR441/2594, INRA/CNRS, F-31320 Castanet Tolosan, France. Celine.Noirot@toulouse.inra.fr

Abstract

Background: In the last decade, sequencing projects have led to the development of a number of annotation systems dedicated to the structural and functional annotation of protein-coding genes. These annotation systems manage the annotation of the non-protein coding genes (ncRNAs) in a very crude way, allowing neither the edition of the secondary structures nor the clustering of ncRNA genes into families which are crucial for appropriate annotation of these molecules.

Results: LeARN is a flexible software package which handles the complete process of ncRNA annotation by integrating the layers of automatic detection and human curation.

Conclusion: This software provides the infrastructure to deal properly with ncRNAs in the framework of any annotation project. It fills the gap between existing prediction software, that detect independent ncRNA occurrences, and public ncRNA repositories, that do not offer the flexibility and interactivity required for annotation projects. The software is freely available from the download section of the website http://bioinfo.genopole-toulouse.prd.fr/LeARN.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Amino Acid Sequence
Cluster Analysis
Database Management Systems*
Databases, Nucleic Acid / organization & administration
Genome, Archaeal
Genomics / methods
Knowledge Bases
Multigene Family
Pattern Recognition, Automated
Pyrococcus abyssi / genetics
Pyrococcus furiosus / genetics
Pyrococcus horikoshii / genetics
RNA, Untranslated / analysis*
Sequence Alignment
Thermococcus / genetics
User-Computer Interface*

Substances

RNA, Untranslated