Ontological function annotation of long non-coding RNAs through hierarchical multi-label classification

Bioinformatics. 2018 May 15;34(10):1750-1757. doi: 10.1093/bioinformatics/btx833.

Abstract

Motivation: Long non-coding RNAs (lncRNAs) are an enormous collection of functional non-coding RNAs. Over the past decades, a large number of novel lncRNA genes have been identified. However, most of the lncRNAs remain function uncharacterized at present. Computational approaches provide a new insight to understand the potential functional implications of lncRNAs.

Results: Considering that each lncRNA may have multiple functions and a function may be further specialized into sub-functions, here we describe NeuraNetL2GO, a computational ontological function prediction approach for lncRNAs using hierarchical multi-label classification strategy based on multiple neural networks. The neural networks are incrementally trained level by level, each performing the prediction of gene ontology (GO) terms belonging to a given level. In NeuraNetL2GO, we use topological features of the lncRNA similarity network as the input of the neural networks and employ the output results to annotate the lncRNAs. We show that NeuraNetL2GO achieves the best performance and the overall advantage in maximum F-measure and coverage on the manually annotated lncRNA2GO-55 dataset compared to other state-of-the-art methods.

Availability and implementation: The source code and data are available at http://denglab.org/NeuraNetL2GO/.

Contact: leideng@csu.edu.cn.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Humans
  • Molecular Sequence Annotation
  • Neoplasms / genetics*
  • Neural Networks, Computer
  • RNA, Long Noncoding / genetics*
  • Software

Substances

  • RNA, Long Noncoding