Conditional probabilities of activity landscape features for individual compounds

J Chem Inf Model. 2013 Jul 22;53(7):1602-12. doi: 10.1021/ci400288r. Epub 2013 Jul 9.

Abstract

Activity landscape representations aid in the analysis of structure-activity relationships (SARs) of large compound data sets. Landscapes are characterized by features with different SAR information content such as, for example, regions formed by structurally diverse compounds having similar activity or, alternatively, structurally similar compounds with large activity differences, so-called activity cliffs. Modeling of activity landscapes typically requires pairwise comparisons of molecular similarity and potency relationships of compounds in a data set. Consequently, landscape features are generally resolved at the level of compound pairs. Herein, we introduce a methodology to assign feature probabilities to individual compounds. This makes it possible to organize compounds comprising activity landscapes into well-defined SAR categories. Specifically, the calculation of conditional feature probabilities of active compounds provides a balanced and further refined view of activity landscapes with a focus on individual molecules.

MeSH terms

  • Drug Discovery
  • Informatics / methods*
  • Probability Theory*
  • Structure-Activity Relationship