Predicting miRNA-disease association from heterogeneous information network with GraRep embedding model

Sci Rep. 2020 Apr 20;10(1):6658. doi: 10.1038/s41598-020-63735-9.

Abstract

In recent years, accumulating evidences have shown that microRNA (miRNA) plays an important role in the exploration and treatment of diseases, so detection of the associations between miRNA and disease has been drawn more and more attentions. However, traditional experimental methods have the limitations of high cost and time- consuming, a computational method can help us more systematically and effectively predict the potential miRNA-disease associations. In this work, we proposed a novel network embedding-based heterogeneous information integration method to predict miRNA-disease associations. More specifically, a heterogeneous information network is constructed by combining the known associations among lncRNA, drug, protein, disease, and miRNA. After that, the network embedding method Learning Graph Representations with Global Structural Information (GraRep) is employed to learn embeddings of nodes in heterogeneous information network. In this way, the embedding representations of miRNA and disease are integrated with the attribute information of miRNA and disease (e.g. miRNA sequence information and disease semantic similarity) to represent miRNA-disease association pairs. Finally, the Random Forest (RF) classifier is used for predicting potential miRNA-disease associations. Under the 5-fold cross validation, our method obtained 85.11% prediction accuracy with 80.41% sensitivity at the AUC of 91.25%. In addition, in case studies of three major Human diseases, 45 (Colon Neoplasms), 42 (Breast Neoplasms) and 44 (Esophageal Neoplasms) of top-50 predicted miRNAs are respectively verified by other miRNA-disease association databases. In conclusion, the experimental results suggest that our method can be a powerful and useful tool for predicting potential miRNA-disease associations.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Antineoplastic Agents / metabolism
  • Antineoplastic Agents / pharmacokinetics
  • Breast Neoplasms / diagnosis
  • Breast Neoplasms / drug therapy
  • Breast Neoplasms / genetics*
  • Breast Neoplasms / pathology
  • Colonic Neoplasms / diagnosis
  • Colonic Neoplasms / drug therapy
  • Colonic Neoplasms / genetics*
  • Colonic Neoplasms / pathology
  • Computational Biology / methods
  • Databases, Genetic
  • Decision Trees
  • Esophageal Neoplasms / diagnosis
  • Esophageal Neoplasms / drug therapy
  • Esophageal Neoplasms / genetics*
  • Esophageal Neoplasms / pathology
  • Female
  • Humans
  • Male
  • MicroRNAs / classification
  • MicroRNAs / genetics*
  • MicroRNAs / metabolism
  • Models, Genetic
  • RNA, Circular / classification
  • RNA, Circular / genetics*
  • RNA, Circular / metabolism
  • RNA, Long Noncoding / classification
  • RNA, Long Noncoding / genetics*
  • RNA, Long Noncoding / metabolism
  • RNA, Messenger / classification
  • RNA, Messenger / genetics*
  • RNA, Messenger / metabolism
  • RNA, Neoplasm / classification
  • RNA, Neoplasm / genetics*
  • RNA, Neoplasm / metabolism

Substances

  • Antineoplastic Agents
  • MicroRNAs
  • RNA, Circular
  • RNA, Long Noncoding
  • RNA, Messenger
  • RNA, Neoplasm