LeARN: a platform for detecting, clustering and annotating non-coding RNAs

BMC Bioinformatics. 2008 Jan 14:9:21. doi: 10.1186/1471-2105-9-21.

Abstract

Background: In the last decade, sequencing projects have led to the development of a number of annotation systems dedicated to the structural and functional annotation of protein-coding genes. These annotation systems manage the annotation of the non-protein coding genes (ncRNAs) in a very crude way, allowing neither the edition of the secondary structures nor the clustering of ncRNA genes into families which are crucial for appropriate annotation of these molecules.

Results: LeARN is a flexible software package which handles the complete process of ncRNA annotation by integrating the layers of automatic detection and human curation.

Conclusion: This software provides the infrastructure to deal properly with ncRNAs in the framework of any annotation project. It fills the gap between existing prediction software, that detect independent ncRNA occurrences, and public ncRNA repositories, that do not offer the flexibility and interactivity required for annotation projects. The software is freely available from the download section of the website http://bioinfo.genopole-toulouse.prd.fr/LeARN.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Cluster Analysis
  • Database Management Systems*
  • Databases, Nucleic Acid / organization & administration
  • Genome, Archaeal
  • Genomics / methods
  • Knowledge Bases
  • Multigene Family
  • Pattern Recognition, Automated
  • Pyrococcus abyssi / genetics
  • Pyrococcus furiosus / genetics
  • Pyrococcus horikoshii / genetics
  • RNA, Untranslated / analysis*
  • Sequence Alignment
  • Thermococcus / genetics
  • User-Computer Interface*

Substances

  • RNA, Untranslated