Machine learning-based chemical binding similarity using evolutionary relationships of target genes

Nucleic Acids Res. 2019 Nov 18;47(20):e128. doi: 10.1093/nar/gkz743.

Abstract

Chemical similarity searching is a basic research tool that can be used to find small molecules which are similar in shape to known active molecules. Despite its popularity, the retrieval of local molecular features that are critical to functional activity related to target binding often fails. To overcome this limitation, we developed a novel machine learning-based chemical binding similarity score by using various evolutionary relationships of binding targets. The chemical similarity was defined by the probability of chemical compounds binding to identical targets. Comprehensive and heterogeneous multiple target-binding chemical data were integrated into a paired data format and processed using multiple classification similarity-learning models with various levels of target evolutionary information. Encoding evolutionary information to chemical compounds through their binding targets substantially expanded available chemical-target interaction data and significantly improved model performance. The output probability of our integrated model, referred to as ensemble evolutionary chemical binding similarity (ensECBS), was effective for finding hidden chemical relationships. The developed method can serve as a novel chemical similarity tool that uses evolutionarily conserved target binding information.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Evolution, Molecular*
  • Genes
  • Humans
  • Machine Learning*
  • Protein Binding
  • Protein Kinase Inhibitors / chemistry*
  • Protein Kinase Inhibitors / pharmacology
  • Protein Kinases / chemistry
  • Protein Kinases / genetics
  • Protein Kinases / metabolism*
  • Quantitative Structure-Activity Relationship
  • Sequence Analysis, Protein / methods*
  • Small Molecule Libraries / chemistry*
  • Small Molecule Libraries / pharmacology

Substances

  • Protein Kinase Inhibitors
  • Small Molecule Libraries
  • Protein Kinases