CYPlebrity: Machine learning models for the prediction of inhibitors of cytochrome P450 enzymes

Bioorg Med Chem. 2021 Sep 15:46:116388. doi: 10.1016/j.bmc.2021.116388. Epub 2021 Aug 28.

Abstract

The vast majority of approved drugs are metabolized by the five major cytochrome P450 (CYP) isozymes, 1A2, 2C9, 2C19, 2D6 and 3A4. Inhibition of CYP isozymes can cause drug-drug interactions with severe pharmacological and toxicological consequences. Computational methods for the fast and reliable prediction of the inhibition of CYP isozymes by small molecules are therefore of high interest and relevance to pharmaceutical companies and a host of other industries, including the cosmetics and agrochemical industries. Today, a large number of machine learning models for predicting the inhibition of the major CYP isozymes by small molecules are available. With this work we aim to go beyond the coverage of existing models, by combining data from several major public and proprietary sources. More specifically, we used up to 18815 compounds with measured bioactivities to train random forest classification models for the individual CYP isozymes. A major advantage of the new data collection over existing ones is the better representation of the minority class, the CYP inhibitors. With the new data collection we achieved inhibitor-to-non-inhibitor ratios in the order of 1:1 (CYP1A2) to 1:3 (CYP2D6). We show that our models reach competitive performance on external data, with Matthews correlation coefficients (MCCs) ranging from 0.62 (CYP2C19) to 0.70 (CYP2D6), and areas under the receiver operating characteristic curve (AUCs) between 0.89 (CYP2C19) and 0.92 (CYPs 2D6 and 3A4). Importantly, the models show a high level of robustness, reflected in a good predictivity also for compounds that are structurally dissimilar to the compounds represented in the training data. The best models presented in this work are freely accessible for academic research via a web service.

Keywords: Cytochrome P450 inhibition; Drug metabolism; Machine learning; Metabolism prediction.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cytochrome P-450 Enzyme Inhibitors / chemical synthesis
  • Cytochrome P-450 Enzyme Inhibitors / chemistry
  • Cytochrome P-450 Enzyme Inhibitors / pharmacology*
  • Cytochrome P-450 Enzyme System / metabolism*
  • Dose-Response Relationship, Drug
  • Humans
  • Machine Learning*
  • Models, Molecular
  • Molecular Structure
  • Structure-Activity Relationship

Substances

  • Cytochrome P-450 Enzyme Inhibitors
  • Cytochrome P-450 Enzyme System