Progress with modeling activity landscapes in drug discovery

Expert Opin Drug Discov. 2018 Jul;13(7):605-615. doi: 10.1080/17460441.2018.1465926. Epub 2018 Apr 19.

Abstract

Activity landscapes (ALs) are representations and models of compound data sets annotated with a target-specific activity. In contrast to quantitative structure-activity relationship (QSAR) models, ALs aim at characterizing structure-activity relationships (SARs) on a large-scale level encompassing all active compounds for specific targets. The popularity of AL modeling has grown substantially with the public availability of large activity-annotated compound data sets. AL modeling crucially depends on molecular representations and similarity metrics used to assess structural similarity. Areas covered: The concepts of AL modeling are introduced and its basis in quantitatively assessing molecular similarity is discussed. The different types of AL modeling approaches are introduced. AL designs can broadly be divided into three categories: compound-pair based, dimensionality reduction, and network approaches. Recent developments for each of these categories are discussed focusing on the application of mathematical, statistical, and machine learning tools for AL modeling. AL modeling using chemical space networks is covered in more detail. Expert opinion: AL modeling has remained a largely descriptive approach for the analysis of SARs. Beyond mere visualization, the application of analytical tools from statistics, machine learning and network theory has aided in the sophistication of AL designs and provides a step forward in transforming ALs from descriptive to predictive tools. To this end, optimizing representations that encode activity relevant features of molecules might prove to be a crucial step.

Keywords: Activity cliffs; activity landscapes; chemical space networks; compound data sets; molecular similarity; networks; structure-activity relationships.

Publication types

  • Review

MeSH terms

  • Drug Design*
  • Drug Discovery / methods*
  • Humans
  • Machine Learning
  • Models, Molecular*
  • Models, Statistical
  • Models, Theoretical
  • Quantitative Structure-Activity Relationship