Predicting Disease Related microRNA Based on Similarity and Topology

Cells. 2019 Nov 7;8(11):1405. doi: 10.3390/cells8111405.

Abstract

It is known that many diseases are caused by mutations or abnormalities in microRNA (miRNA). The usual method to predict miRNA disease relationships is to build a high-quality similarity network of diseases and miRNAs. All unobserved associations are ranked by their similarity scores, such that a higher score indicates a greater probability of a potential connection. However, this approach does not utilize information within the network. Therefore, in this study, we propose a machine learning method, called STIM, which uses network topology information to predict disease-miRNA associations. In contrast to the conventional approach, STIM constructs features according to information on similarity and topology in networks and then uses a machine learning model to predict potential associations. To verify the reliability and accuracy of our method, we compared STIM to other classical algorithms. The results of fivefold cross validation demonstrated that STIM outperforms many existing methods, particularly in terms of the area under the curve. In addition, the top 30 candidate miRNAs recommended by STIM in a case study of lung neoplasm have been confirmed in previous experiments, which proved the validity of the method.

Keywords: heterogeneous network; link prediction; machine learning; miRNA; network embedding; topology information.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Biomarkers
  • Computational Biology* / methods
  • Data Mining
  • Databases, Genetic
  • Gene Ontology
  • Gene Regulatory Networks
  • Genetic Predisposition to Disease*
  • Humans
  • MicroRNAs / genetics*
  • Prognosis
  • ROC Curve
  • Reproducibility of Results

Substances

  • Biomarkers
  • MicroRNAs