SCMFMDA: Predicting microRNA-disease associations based on similarity constrained matrix factorization

PLoS Comput Biol. 2021 Jul 12;17(7):e1009165. doi: 10.1371/journal.pcbi.1009165. eCollection 2021 Jul.

Abstract

miRNAs belong to small non-coding RNAs that are related to a number of complicated biological processes. Considerable studies have suggested that miRNAs are closely associated with many human diseases. In this study, we proposed a computational model based on Similarity Constrained Matrix Factorization for miRNA-Disease Association Prediction (SCMFMDA). In order to effectively combine different disease and miRNA similarity data, we applied similarity network fusion algorithm to obtain integrated disease similarity (composed of disease functional similarity, disease semantic similarity and disease Gaussian interaction profile kernel similarity) and integrated miRNA similarity (composed of miRNA functional similarity, miRNA sequence similarity and miRNA Gaussian interaction profile kernel similarity). In addition, the L2 regularization terms and similarity constraint terms were added to traditional Nonnegative Matrix Factorization algorithm to predict disease-related miRNAs. SCMFMDA achieved AUCs of 0.9675 and 0.9447 based on global Leave-one-out cross validation and five-fold cross validation, respectively. Furthermore, the case studies on two common human diseases were also implemented to demonstrate the prediction accuracy of SCMFMDA. The out of top 50 predicted miRNAs confirmed by experimental reports that indicated SCMFMDA was effective for prediction of relationship between miRNAs and diseases.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Computational Biology
  • Disease* / classification
  • Disease* / genetics
  • Humans
  • MicroRNAs* / analysis
  • MicroRNAs* / classification
  • MicroRNAs* / genetics
  • Models, Statistical*

Substances

  • MicroRNAs

Grants and funding

This work was supported by the National Natural Science Foundation of China through grants 61873001 (C.Z., Y.W.), U19A2064 (C.Z.), 61872220 (C.Z.) and 11701318 (Y.W.), the Natural Science Foundation of Shandong Province grant ZR2020KC022 (J.N., Y.W., Z.G., L.L) and the Open Project of Anhui Provincial Key Laboratory of Multimodal Cognitive Computation, Anhui University, No. MMC202006 (Y.W., L.L). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.