Computational prediction for the metabolism of human UDP-glucuronosyltransferase 1A1 substrates

Comput Biol Med. 2022 Oct:149:105959. doi: 10.1016/j.compbiomed.2022.105959. Epub 2022 Aug 19.

Abstract

UDP-glucuronosyltransferase (UGT) 1A1, one of the most important isoforms in UGTs superfamily, has attracted increasing concerns for its special role in the clearance and detoxification of endogenous and exogenous substances. To avoid the clinical drug-drug interactions, it is of great importance to have the knowledge of the metabolic profile of UGT1A1 substrates early. Herein, we purposed to establish machine learning models to predict the metabolic propeties of UGT1A1 substrates. On the basis of the literature-derived substrates database of UGT1A1, automatic metabolism prediction models for the aromatic hydroxyl (ArOH) and carboxyl (COOH) groups were developed with eight machine learning methods, among which, three methods, i.e. Random Forest, Random Subspace and J48, illustrated the best performance either for the aromatic hydroxyl and the carboxyl model. The models illustrated good robustness when they were evaluated with functions like "Precision", "Recall", "F-Measure", "AUC", "MCC", etc. Nice accuracy was observed for the aromatic hydroxyl and carboxyl model of these methods, whose AUCs ranged from 0.901 to 0.997. Additionally, the ArOH model was applied to predict the UGT1A1-mediated metabolism of an external set. Two new unknown substrates, cytochrome P450 (CYPs)-mediated metabolites of gefitinib, were predicted and identified, which were validated by in vitro assays. In summary, this study provides a reliable and robust strategy to predict UGT1A1 metabolites, which will be helpful either in rational-optimization of drug metabolism or in avoiding drug-drug interactions in clinic.

Keywords: Feature selection; Machine learning; Metabolism; UGT1A1.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cytochrome P-450 Enzyme System* / metabolism
  • Gefitinib
  • Glucuronosyltransferase* / metabolism
  • Humans
  • Protein Isoforms
  • Uridine Diphosphate

Substances

  • Protein Isoforms
  • Uridine Diphosphate
  • Cytochrome P-450 Enzyme System
  • UGT1A1 enzyme
  • Glucuronosyltransferase
  • Gefitinib