Long non-coding RNAs function annotation: a global prediction method based on bi-colored networks

Nucleic Acids Res. 2013 Jan;41(2):e35. doi: 10.1093/nar/gks967. Epub 2012 Nov 5.

Abstract

More and more evidences demonstrate that the long non-coding RNAs (lncRNAs) play many key roles in diverse biological processes. There is a critical need to annotate the functions of increasing available lncRNAs. In this article, we try to apply a global network-based strategy to tackle this issue for the first time. We develop a bi-colored network based global function predictor, long non-coding RNA global function predictor ('lnc-GFP'), to predict probable functions for lncRNAs at large scale by integrating gene expression data and protein interaction data. The performance of lnc-GFP is evaluated on protein-coding and lncRNA genes. Cross-validation tests on protein-coding genes with known function annotations indicate that our method can achieve a precision up to 95%, with a suitable parameter setting. Among the 1713 lncRNAs in the bi-colored network, the 1625 (94.9%) lncRNAs in the maximum connected component are all functionally characterized. For the lncRNAs expressed in mouse embryo stem cells and neuronal cells, the inferred putative functions by our method highly match those in the known literature.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Brain / metabolism
  • Embryonic Stem Cells / metabolism
  • Gene Expression
  • Humans
  • Mice
  • Molecular Sequence Annotation / methods*
  • Neurons / metabolism
  • Protein Interaction Maps
  • RNA, Long Noncoding / metabolism
  • RNA, Long Noncoding / physiology*

Substances

  • RNA, Long Noncoding