Drug design by machine learning: the use of inductive logic programming to model the structure-activity relationships of trimethoprim analogues binding to dihydrofolate reductase

Proc Natl Acad Sci U S A. 1992 Dec 1;89(23):11322-6. doi: 10.1073/pnas.89.23.11322.

Abstract

The machine learning program GOLEM from the field of inductive logic programming was applied to the drug design problem of modeling structure-activity relationships. The training data for the program were 44 trimethoprim analogues and their observed inhibition of Escherichia coli dihydrofolate reductase. A further 11 compounds were used as unseen test data. GOLEM obtained rules that were statistically more accurate on the training data and also better on the test data than a Hansch linear regression model. Importantly machine learning yields understandable rules that characterized the chemistry of favored inhibitors in terms of polarity, flexibility, and hydrogen-bonding character. These rules agree with the stereochemistry of the interaction observed crystallographically.

MeSH terms

  • Artificial Intelligence*
  • Drug Design*
  • Escherichia coli / enzymology
  • Folic Acid Antagonists*
  • Molecular Structure
  • Structure-Activity Relationship
  • Trimethoprim / analogs & derivatives*
  • Trimethoprim / chemistry

Substances

  • Folic Acid Antagonists
  • Trimethoprim