Prediction of acute toxicity of phenol derivatives using multiple linear regression approach for Tetrahymena pyriformis contaminant identification in a median-size database

Chemosphere. 2016 Dec:165:434-441. doi: 10.1016/j.chemosphere.2016.09.041. Epub 2016 Sep 30.

Abstract

In this article, the modeling of inhibitory grown activity against Tetrahymena pyriformis is described. The 0-2D Dragon descriptors based on structural aspects to gain some knowledge of factors influencing aquatic toxicity are mainly used. Besides, it is done by some enlarged data of phenol derivatives described for the first time and composed of 358 chemicals. It overcomes the previous datasets with about one hundred compounds. Moreover, the results of the model evaluation by the parameters in the training, prediction and validation give adequate results comparable with those of the previous works. The more influential descriptors included in the model are: X3A, MWC02, MWC10 and piPC03 with positive contributions to the dependent variable; and MWC09, piPC02 and TPC with negative contributions. In a next step, a median-size database of nearly 8000 phenolic compounds extracted from ChEMBL was evaluated with the quantitative-structure toxicity relationship (QSTR) model developed providing some clues (SARs) for identification of ecotoxicological compounds. The outcome of this report is very useful to screen chemical databases for finding the compounds responsible of aquatic contamination in the biomarker used in the current work.

Keywords: ChEMBL; Dragon descriptor; Multiple linear regression; Phenol; QSTR; Tetrahymena pyriformis.

MeSH terms

  • Databases, Factual
  • Linear Models
  • Models, Theoretical*
  • Phenols / chemistry
  • Phenols / toxicity*
  • Quantitative Structure-Activity Relationship
  • Tetrahymena pyriformis / drug effects*

Substances

  • Phenols