Botanical drugs: a new strategy for structure-based target prediction

Brief Bioinform. 2022 Jan 17;23(1):bbab425. doi: 10.1093/bib/bbab425.

Abstract

Target identification of small molecules is an important and still changeling work in the area of drug discovery, especially for botanical drug development. Indistinct understanding of the relationships of ligand-protein interactions is one of the main obstacles for drug repurposing and identification of off-targets. In this study, we collected 9063 crystal structures of ligand-binding proteins released from January, 1995 to April, 2021 in PDB bank, and split the complexes into 5133 interaction pairs of ligand atoms and protein fragments (covalently linked three heavy atoms) with interatomic distance ≤5 Å. The interaction pairs were grouped into ligand atoms with the same SYBYL atom type surrounding each type of protein fragment, which were further clustered via Bayesian Gaussian Mixture Model (BGMM). Gaussian distributions with ligand atoms ≥20 were identified as significant interaction patterns. Reliability of the significant interaction patterns was validated by comparing the difference of number of significant interaction patterns between the docked poses with higher and lower similarity to the native crystal structures. Fifty-one candidate targets of brucine, strychnine and icajine involved in Semen Strychni (Mǎ Qián Zǐ) and eight candidate targets of astragaloside-IV, formononetin and calycosin-7-glucoside involved in Astragalus (Huáng Qí) were predicted by the significant interaction patterns, in combination with docking, which were consistent with the therapeutic effects of Semen Strychni and Astragalus for cancer and chronic pain. The new strategy in this study improves the accuracy of target identification for small molecules, which will facilitate discovery of botanical drugs.

Keywords: Bayesian Gaussian mixture model; botanical drugs; drug repurposing; off-target identification; target identification.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bayes Theorem*
  • Ligands
  • Protein Binding
  • Reproducibility of Results

Substances

  • Ligands