Classifying "kinase inhibitor-likeness" by using machine-learning methods

Hans Briem; Judith Günther

doi:10.1002/cbic.200400109

Classifying "kinase inhibitor-likeness" by using machine-learning methods

Chembiochem. 2005 Mar;6(3):558-66. doi: 10.1002/cbic.200400109.

Authors

Hans Briem¹, Judith Günther

Affiliation

¹ Schering AG, Research Center Europe, CDCC/Computational Chemistry, Muellerstrasse 178, 13342 Berlin, Germany. hans.briem@schering.de

PMID: 15696507
DOI: 10.1002/cbic.200400109

Abstract

By using an in-house data set of small-molecule structures, encoded by Ghose-Crippen parameters, several machine learning techniques were applied to distinguish between kinase inhibitors and other molecules with no reported activity on any protein kinase. All four approaches pursued--support-vector machines (SVM), artificial neural networks (ANN), k nearest neighbor classification with GA-optimized feature selection (GA/kNN), and recursive partitioning (RP)--proved capable of providing a reasonable discrimination. Nevertheless, substantial differences in performance among the methods were observed. For all techniques tested, the use of a consensus vote of the 13 different models derived improved the quality of the predictions in terms of accuracy, precision, recall, and F1 value. Support-vector machines, followed by the GA/kNN combination, outperformed the other techniques when comparing the average of individual models. By using the respective majority votes, the prediction of neural networks yielded the highest F1 value, followed by SVMs.

MeSH terms

Algorithms
Computational Biology / methods*
Computer Simulation*
Molecular Structure
Neural Networks, Computer
Protein Kinase Inhibitors / chemistry*
Protein Kinase Inhibitors / classification*
Reproducibility of Results
Sensitivity and Specificity

Substances

Protein Kinase Inhibitors