A Robust Machine Learning Framework Built Upon Molecular Representations Predicts CYP450 Inhibition: Toward Precision in Drug Repurposing

OMICS. 2023 Jul;27(7):305-314. doi: 10.1089/omi.2023.0075. Epub 2023 Jul 4.

Abstract

Human cytochrome P450 (CYP450) enzymes play a crucial role in drug metabolism and pharmacokinetics. CYP450 inhibition can lead to toxicity, in particular when drugs are co-administered with other drugs and xenobiotics or in the case of polypharmacy. Predicting CYP450 inhibition is also important for rational drug discovery and development, and precision in drug repurposing. In this overarching context, digital transformation of drug discovery and development, for example, using machine and deep learning approaches, offers prospects for prediction of CYP450 inhibition through computational models. We report here the development of a majority-voting machine learning framework to classify inhibitors and noninhibitors for seven major human liver CYP450 isoforms (CYP1A2, CYP2A6, CYP2B6, CYP2C9, CYP2C19, CYP2D6, and CYP3A4). For the machine learning models reported herein, we employed interaction fingerprints that were derived from molecular docking simulations, thus adding an additional layer of information for protein-ligand interactions. The proposed machine learning framework is based on the structure of the binding site of isoforms to produce predictions beyond previously reported approaches. Also, we carried out a comparative analysis so as to identify which representation of test compounds (molecular descriptors, molecular fingerprints, or protein-ligand interaction fingerprints) affects the predictive performance of the models. This work underlines the ways in which the structure of the enzyme catalytic site influences machine learning predictions and the need for robust frameworks toward better-informed predictions.

Keywords: ADME-Tox; cytochrome P450; drug repurposing; machine learning; predictive computational models; quantitative structure-activity relationships.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cytochrome P-450 Enzyme System* / metabolism
  • Drug Repositioning*
  • Humans
  • Ligands
  • Machine Learning
  • Molecular Docking Simulation
  • Protein Isoforms / metabolism

Substances

  • Ligands
  • Cytochrome P-450 Enzyme System
  • Protein Isoforms