Graph mining for SAR transfer series

J Chem Inf Model. 2012 Apr 23;52(4):935-42. doi: 10.1021/ci300071y. Epub 2012 Apr 3.

Abstract

The transfer of SAR information from one analog series to another is a difficult, yet highly attractive task in medicinal chemistry. At present, the evaluation of SAR transfer potential from a data mining perspective is still in its infancy. Only recently, a first computational approach has been introduced to evaluate SAR transfer events. Here, a substructure relationship-based molecular network representation has been used as a starting point to systematically identify SAR transfer series in large compound data sets. For this purpose, a methodology is introduced that consists of two stages. For graph mining, an algorithm has been designed that extracts all parallel series from compound data sets. A parallel series is formed by two series of analogs with different core structures but pairwise corresponding substitution patterns. The SAR transfer potential of identified parallel series is then evaluated using a scoring function that emphasizes corresponding potency progression over many analog pairs and large potency ranges. The substructure relationship-based molecular network in combination with the graph mining algorithm currently represents the only generally applicable approach to systematically detect SAR transfer events in large compound data sets. The combined approach has been evaluated on a large number of compound data sets and shown to systematically identify SAR transfer series.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Antithrombins / chemistry*
  • Chemistry, Pharmaceutical
  • Data Mining*
  • Databases, Chemical
  • Drug Design
  • Drug Discovery
  • Humans
  • Protein Binding
  • Research Design
  • Small Molecule Libraries / chemistry*
  • Structure-Activity Relationship*
  • Thrombin / antagonists & inhibitors
  • Thrombin / chemistry*

Substances

  • Antithrombins
  • Small Molecule Libraries
  • Thrombin