A learning based framework for diverse biomolecule relationship prediction in molecular association network

Commun Biol. 2020 Mar 13;3(1):118. doi: 10.1038/s42003-020-0858-8.

Abstract

Abundant life activities are maintained by various biomolecule relationships in human cells. However, many previous computational models only focus on isolated objects, without considering that cell is a complete entity with ample functions. Inspired by holism, we constructed a Molecular Associations Network (MAN) including 9 kinds of relationships among 5 types of biomolecules, and a prediction model called MAN-GF. More specifically, biomolecules can be represented as vectors by the algorithm called biomarker2vec which combines 2 kinds of information involved the attribute learned by k-mer, etc and the behavior learned by Graph Factorization (GF). Then, Random Forest classifier is applied for training, validation and test. MAN-GF obtained a substantial performance with AUC of 0.9647 and AUPR of 0.9521 under 5-fold Cross-validation. The results imply that MAN-GF with an overall perspective can act as ancillary for practice. Besides, it holds great hope to provide a new insight to elucidate the regulatory mechanisms.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Area Under Curve
  • Colonic Neoplasms / metabolism*
  • Computational Biology / methods*
  • Data Accuracy
  • Data Mining / methods
  • Humans
  • MicroRNAs / metabolism*
  • Models, Biological*
  • Protein Interaction Maps*
  • Proteins / metabolism*
  • RNA, Long Noncoding / metabolism*
  • ROC Curve
  • Sensitivity and Specificity

Substances

  • MicroRNAs
  • Proteins
  • RNA, Long Noncoding