Introducing conformal prediction in predictive modeling for regulatory purposes. A transparent and flexible alternative to applicability domain determination

Regul Toxicol Pharmacol. 2015 Mar;71(2):279-84. doi: 10.1016/j.yrtph.2014.12.021. Epub 2015 Jan 2.

Abstract

Conformal prediction is presented as a framework which fulfills the OECD principles on (Q)SAR. It offers an intuitive extension to the application of machine-learning methods to structure-activity data where focus is on predictions with pre-defined confidence levels. A conformal predictor will make correct predictions on new compounds corresponding to a user defined confidence level. The confidence level can be altered depending on the situation the predictor is being used in, which allows for flexibility and adaption to risks that the user is willing to take. We demonstrate the usefulness of conformal prediction by applying it to 2 publicly available CAESAR binary classification datasets.

Keywords: Applicability domain; CAESAR; Confidence predictor; Conformal prediction; Conformity score; MOE descriptors; REACH; Random forest; Signature descriptors.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Databases, Factual*
  • Drug and Narcotic Control / legislation & jurisprudence*
  • Drug and Narcotic Control / methods
  • Forecasting
  • Models, Theoretical*
  • Molecular Conformation*
  • Quantitative Structure-Activity Relationship