QSAR-assisted-MMPA to expand chemical transformation space for lead optimization

Brief Bioinform. 2021 Sep 2;22(5):bbaa374. doi: 10.1093/bib/bbaa374.

Abstract

Matched molecular pairs analysis (MMPA) has become a powerful tool for automatically and systematically identifying medicinal chemistry transformations from compound/property datasets. However, accurate determination of matched molecular pair (MMP) transformations largely depend on the size and quality of existing experimental data. Lack of high-quality experimental data heavily hampers the extraction of more effective medicinal chemistry knowledge. Here, we developed a new strategy called quantitative structure-activity relationship (QSAR)-assisted-MMPA to expand the number of chemical transformations and took the logD7.4 property endpoint as an example to demonstrate the reliability of the new method. A reliable logD7.4 consensus prediction model was firstly established, and its applicability domain was strictly assessed. By applying the reliable logD7.4 prediction model to screen two chemical databases, we obtained more high-quality logD7.4 data by defining a strict applicability domain threshold. Then, MMPA was performed on the predicted data and experimental data to derive more chemical rules. To validate the reliability of the chemical rules, we compared the magnitude and directionality of the property changes of the predicted rules with those of the measured rules. Then, we compared the novel chemical rules generated by our proposed approach with the published chemical rules, and found that the magnitude and directionality of the property changes were consistent, indicating that the proposed QSAR-assisted-MMPA approach has the potential to enrich the collection of rule types or even identify completely novel rules. Finally, we found that the number of the MMP rules derived from the experimental data could be amplified by the predicted data, which is helpful for us to analyze the medicinal chemical rules in local chemical environment. In summary, the proposed QSAR-assisted-MMPA approach could be regarded as a very promising strategy to expand the chemical transformation space for lead optimization, especially when no enough experimental data can support MMPA.

Keywords: MMPA; QSAR; lead optimization; machine learning; medicinal chemical rules.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biotransformation
  • Chemistry Techniques, Synthetic / methods*
  • Chemistry, Pharmaceutical / methods*
  • Databases, Chemical
  • Datasets as Topic
  • Drug Discovery / methods*
  • Drug Discovery / statistics & numerical data
  • Drugs, Investigational / chemical synthesis*
  • Drugs, Investigational / metabolism
  • Humans
  • Models, Statistical*
  • Molecular Structure
  • Quantitative Structure-Activity Relationship
  • Reproducibility of Results

Substances

  • Drugs, Investigational