Prediction of microRNA-disease associations based on distance correlation set

BMC Bioinformatics. 2018 Apr 17;19(1):141. doi: 10.1186/s12859-018-2146-x.

Abstract

Background: Recently, numerous laboratory studies have indicated that many microRNAs (miRNAs) are involved in and associated with human diseases and can serve as potential biomarkers and drug targets. Therefore, developing effective computational models for the prediction of novel associations between diseases and miRNAs could be beneficial for achieving an understanding of disease mechanisms at the miRNA level and the interactions between diseases and miRNAs at the disease level. Thus far, only a few miRNA-disease association pairs are known, and models analyzing miRNA-disease associations based on lncRNA are limited.

Results: In this study, a new computational method based on a distance correlation set is developed to predict miRNA-disease associations (DCSMDA) by integrating known lncRNA-disease associations, known miRNA-lncRNA associations, disease semantic similarity, and various lncRNA and disease similarity measures. The novelty of DCSMDA is due to the construction of a miRNA-lncRNA-disease network, which reveals that DCSMDA can be applied to predict potential lncRNA-disease associations without requiring any known miRNA-disease associations. Although the implementation of DCSMDA does not require known disease-miRNA associations, the area under curve is 0.8155 in the leave-one-out cross validation. Furthermore, DCSMDA was implemented in case studies of prostatic neoplasms, lung neoplasms and leukaemia, and of the top 10 predicted associations, 10, 9 and 9 associations, respectively, were separately verified in other independent studies and biological experimental studies. In addition, 10 of the 10 (100%) associations predicted by DCSMDA were supported by recent bioinformatical studies.

Conclusions: According to the simulation results, DCSMDA can be a great addition to the biomedical research field.

Keywords: Disease-lncRNA-miRNA network; Distance correlation set; MiRNA-disease association predictions; Similarity measure.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Area Under Curve
  • Computational Biology
  • Databases, Genetic
  • Genetic Predisposition to Disease*
  • Humans
  • Male
  • MicroRNAs / genetics*
  • MicroRNAs / metabolism
  • Models, Genetic
  • Neoplasms / genetics
  • RNA, Long Noncoding / genetics
  • RNA, Long Noncoding / metabolism

Substances

  • MicroRNAs
  • RNA, Long Noncoding