PESM: predicting the essentiality of miRNAs based on gradient boosting machines and sequences

BMC Bioinformatics. 2020 Mar 18;21(1):111. doi: 10.1186/s12859-020-3426-9.

Abstract

Background: MicroRNAs (miRNAs) are a kind of small noncoding RNA molecules that are direct posttranscriptional regulations of mRNA targets. Studies have indicated that miRNAs play key roles in complex diseases by taking part in many biological processes, such as cell growth, cell death and so on. Therefore, in order to improve the effectiveness of disease diagnosis and treatment, it is appealing to develop advanced computational methods for predicting the essentiality of miRNAs.

Result: In this study, we propose a method (PESM) to predict the miRNA essentiality based on gradient boosting machines and miRNA sequences. First, PESM extracts the sequence and structural features of miRNAs. Then it uses gradient boosting machines to predict the essentiality of miRNAs. We conduct the 5-fold cross-validation to assess the prediction performance of our method. The area under the receiver operating characteristic curve (AUC), F-measure and accuracy (ACC) are used as the metrics to evaluate the prediction performance. We also compare PESM with other three competing methods which include miES, Gaussian Naive Bayes and Support Vector Machine.

Conclusion: The results of experiments show that PESM achieves the better prediction performance (AUC: 0.9117, F-measure: 0.8572, ACC: 0.8516) than other three computing methods. In addition, the relative importance of all features also further shows that newly added features can be helpful to improve the prediction performance of methods.

Keywords: Essentiality; Gradient boosting machines; MiRNA.

Publication types

  • Evaluation Study

MeSH terms

  • Animals
  • Bayes Theorem
  • Computational Biology / methods*
  • Humans
  • Mice
  • MicroRNAs / chemistry
  • MicroRNAs / genetics*
  • Rats
  • Support Vector Machine

Substances

  • MicroRNAs