Extraction of discontinuous structure-activity relationships from compound data sets through particle swarm optimization

J Chem Inf Model. 2011 Jul 25;51(7):1545-51. doi: 10.1021/ci2001692. Epub 2011 Jun 24.

Abstract

The characterization of structure-activity relationship (SAR) features of large compound data sets has been a hot topic in recent years, and different methods for large-scale SAR analysis have been introduced. The exploration of local SAR components and prioritization of compound subsets have thus far mostly relied on graphical analysis methods that capture similarity and potency relationships in a systematic manner. A currently unsolved problem in large-scale SAR analysis is how to automatically select those compound subsets from large data sets that carry most SAR information. For this purpose, we introduce a numerical optimization scheme that is based on particle swarm optimization guided by an SAR scoring function. The methodology is applied to four large compound sets. We demonstrate that compound subsets representing the most discontinuous local SARs are consistently selected through particle swarm optimization.

MeSH terms

  • Animals
  • Computational Biology*
  • Computer Simulation*
  • Enzyme Inhibitors / chemistry*
  • Humans
  • Molecular Structure
  • Small Molecule Libraries*
  • Structure-Activity Relationship

Substances

  • Enzyme Inhibitors
  • Small Molecule Libraries