Identification of active or inactive agonists of tumor suppressor protein based on Tox21 library

Toxicology. 2022 May 30:474:153224. doi: 10.1016/j.tox.2022.153224. Epub 2022 Jun 1.

Abstract

Exposure of cells to xenobiotic human-made products can lead to genotoxicity and cause DNA damage. It is an urgent need to quickly identify the chemicals that cause DNA damage, and their toxicity should be predicted. In this study, recursive partitioning (RP), binary logistic regression, and one machine learning approach, namely, random forest (RF) classifier, were used to predict the active and inactive compounds of a total 5036 data based on the assay conducted by a β-lactamase reporter gene under control of the p53 response element (p53RE) from Tox21 library. Results show that the binary logistic regression model with a threshold of 0.5 has a high accuracy rate (83%) to distinguish active and inactive compounds. The RF classifier method has satisfactory results, with an accuracy rate (84.38%) approximately higher than that of binary logistic regression. The models established can identify compounds that induce DNA damage and activate p53, and provide a scientific basis for the risk assessment of organic chemicals in the environment.

Keywords: Binary logistic regression; P53-bla assay; RF classifier; Tox21.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biological Assay
  • DNA Damage*
  • Genes, Reporter
  • Humans
  • Logistic Models
  • Tumor Suppressor Protein p53* / agonists

Substances

  • Tumor Suppressor Protein p53