Machine learning in virtual screening

James L Melville; Edmund K Burke; Jonathan D Hirst

doi:10.2174/138620709788167980

Machine learning in virtual screening

Comb Chem High Throughput Screen. 2009 May;12(4):332-43. doi: 10.2174/138620709788167980.

Authors

James L Melville¹, Edmund K Burke, Jonathan D Hirst

Affiliation

¹ School of Chemistry, University of Nottingham, University Park, Nottingham, UK.

PMID: 19442063
DOI: 10.2174/138620709788167980

Abstract

In this review, we highlight recent applications of machine learning to virtual screening, focusing on the use of supervised techniques to train statistical learning algorithms to prioritize databases of molecules as active against a particular protein target. Both ligand-based similarity searching and structure-based docking have benefited from machine learning algorithms, including naïve Bayesian classifiers, support vector machines, neural networks, and decision trees, as well as more traditional regression techniques. Effective application of these methodologies requires an appreciation of data preparation, validation, optimization, and search methodologies, and we also survey developments in these areas.

Publication types

Research Support, Non-U.S. Gov't
Review

MeSH terms

Algorithms
Artificial Intelligence*
Computer Simulation
Databases, Factual
Drug Evaluation, Preclinical / methods*
Models, Chemical
Regression Analysis
Structure-Activity Relationship