Classification of a Naïve Bayesian Fingerprint model to predict reproductive toxicity$

SAR QSAR Environ Res. 2018 Aug;29(8):631-645. doi: 10.1080/1062936X.2018.1499125. Epub 2018 Jul 31.

Abstract

Using data from the Leadscope database and Procter and Gamble researchers (1172 compounds after data curation) a new classification model to predict reproductive toxicity was developed. The model is based on Naïve Bayesian methods that use the fingerprint "extended connectivity fingerprint 2". Bits generated by the fingerprint are used from the models as descriptors to discriminate between the two classes. This technique permits the creation of a model without the use of descriptors. After a study on the probability scores, the Naïve Bayesian Fingerprint model shows a good performance on reproductive toxicity. The Matthews Correlation Coefficient value was ≥0.4 in validation. The development of new models to predict complex endpoints such as reproductive toxicity is increasingly requested, with reference also to the REACH legislation in Europe or TSCA in the USA.

Keywords: QSAR; classification; fingerprint; reproductive toxicity.

MeSH terms

  • Animals
  • Bayes Theorem
  • Mice
  • Models, Molecular
  • Quantitative Structure-Activity Relationship*
  • Rats
  • Reproduction / drug effects*
  • Toxicity Tests