A novel algorithm based on bi-random walks to identify disease-related lncRNAs

BMC Bioinformatics. 2019 Nov 25;20(Suppl 18):569. doi: 10.1186/s12859-019-3128-3.

Abstract

Backgrounds: There is evidence to suggest that lncRNAs are associated with distinct and diverse biological processes. The dysfunction or mutation of lncRNAs are implicated in a wide range of diseases. An accurate computational model can benefit the diagnosis of diseases and help us to gain a better understanding of the molecular mechanism. Although many related algorithms have been proposed, there is still much room to improve the accuracy of the algorithm.

Results: We developed a novel algorithm, BiWalkLDA, to predict disease-related lncRNAs in three real datasets, which have 528 lncRNAs, 545 diseases and 1216 interactions in total. To compare performance with other algorithms, the leave-one-out validation test was performed for BiWalkLDA and three other existing algorithms, SIMCLDA, LDAP and LRLSLDA. Additional tests were carefully designed to analyze the parameter effects such as α, β, l and r, which could help user to select the best choice of these parameters in their own application. In a case study of prostate cancer, eight out of the top-ten disease-related lncRNAs reported by BiWalkLDA were previously confirmed in literatures.

Conclusions: In this paper, we develop an algorithm, BiWalkLDA, to predict lncRNA-disease association by using bi-random walks. It constructs a lncRNA-disease network by integrating interaction profile and gene ontology information. Solving cold-start problem by using neighbors' interaction profile information. Then, bi-random walks was applied to three real biological datasets. Results show that our method outperforms other algorithms in predicting lncRNA-disease association in terms of both accuracy and specificity.

Availability: https://github.com/screamer/BiwalkLDA.

Keywords: Bi-random walks; Gene ontology; Interaction profile; LncRNA-disease association.

Publication types

  • Evaluation Study

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • Computer Simulation
  • Disease / genetics*
  • Gene Ontology
  • Humans
  • RNA, Long Noncoding / genetics*
  • Software

Substances

  • RNA, Long Noncoding