OCEAN: Optimized Cross rEActivity estimatioN

J Chem Inf Model. 2016 Oct 24;56(10):2013-2023. doi: 10.1021/acs.jcim.6b00067. Epub 2016 Sep 26.

Abstract

The prediction of molecular targets is highly beneficial during the drug discovery process, be it for off-target elucidation or deconvolution of phenotypic screens. Here, we present OCEAN, a target prediction tool exclusively utilizing publically available ChEMBL data. OCEAN uses a heuristics approach based on a validation set containing almost 1000 drug ← → target relationships. New ChEMBL data (ChEMBL20 as well as ChEMBL21) released after the validation was used for a prospective OCEAN performance check. The success rates of OCEAN to predict correctly the targets within the TOP10 ranks are 77% for recently marketed drugs and 62% for all new ChEMBL20 compounds and 51% for all new ChEMBL21 compounds. OCEAN is also capable of identifying polypharmacological compounds; the success rate for molecules simultaneously hitting at least two targets is 64% to be correctly predicted within the TOP10 ranks. The source code of OCEAN can be found at http://www.github.com/rdkit/OCEAN.

MeSH terms

  • Algorithms
  • Animals
  • Databases, Pharmaceutical
  • Drug Discovery / methods*
  • Humans
  • Internet
  • Molecular Targeted Therapy
  • Polypharmacology
  • Proteins / metabolism
  • Small Molecule Libraries / chemistry
  • Small Molecule Libraries / pharmacology
  • Software*

Substances

  • Proteins
  • Small Molecule Libraries