Identification of intrinsic hepatotoxic compounds in Polygonum multiflorum Thunb. using machine-learning methods

J Ethnopharmacol. 2022 Nov 15:298:115620. doi: 10.1016/j.jep.2022.115620. Epub 2022 Aug 10.

Abstract

Ethnopharmacological relevance: Polygonum multiflorum Thunb. (PM) is a herb, extracts of which have been used as Chinese medicine for years. Although it is believed to be beneficial to the liver, heart, and kidneys, it causes idiosyncratic drug-induced liver injury (DILI).

Aim of the study: We propose that the intrinsic DILI caused by natural products in PM (NPPM) is an important complementary mechanism to PM-related herb-induced liver injury, and aim to identify the ingredients with high DILI potential by machine learning methods.

Materials and methods: One hundred and ninety-seven NPPM were collected from the literature to identify the intrinsic hepatotoxic compounds. Additionally, a DILI-labeled dataset consisting of 2384 compounds was collected and randomly split into training and test sets. A diparametric optimization method was developed to tune the parameters of extended-connectivity fingerprints (ECFPs), Rdkit, and atom-pair fingerprints as well as those of machine-learning (ML) algorithms. Subsequently, K means were employed to cluster the NPPM that were predicted to have a high DILI risk. An in vitro cell-viability assay was performed using HepaRG cells to validate the prediction results.

Results: ECFPs with the top 35% of features ranked by the F-value with support vector machine (SVM) yielded the best performance. The optimized SVM model achieved an accuracy of 0.761 and recall value of 0.834 on the test dataset. The silico screening for NPPM resulted in 47 ingredients with high DILI potential, which were clustered into six groups based on the elbow method. A representative subgroup that contained 21 ingredients, of which two dianthrones exhibited the lowest IC50 value (0.7-0.9 μM) and anthraquinones showed moderate toxicity (15-25 μM), was constructed.

Conclusion: Using ML methods and in vitro screening, two classes of compounds, dianthrones and anthraquinones, were predicted and validated to have a high risk of DILI. The diparametric optimization method used in this study could provide a useful and powerful tool to screen toxicants for large datasets and is available at https://github.com/dreadlesss/Hepatotoxicity_predictor.

Keywords: Dianthrones; Drug-induced liver injury; K means; Machine learning; Polygonum multiflorum Thunb..

MeSH terms

  • Anthraquinones
  • Biological Products*
  • Chemical and Drug Induced Liver Injury* / etiology
  • Drug-Related Side Effects and Adverse Reactions*
  • Fallopia multiflora*
  • Machine Learning
  • Polygonum*

Substances

  • Anthraquinones
  • Biological Products