Consensus QSAR models estimating acute toxicity to aquatic organisms from different trophic levels: algae, Daphnia and fish

SAR QSAR Environ Res. 2020 Sep;31(9):655-675. doi: 10.1080/1062936X.2020.1797872. Epub 2020 Aug 17.

Abstract

We report new consensus models estimating acute toxicity for algae, Daphnia and fish endpoints. We assembled a large collection of 3680 public unique compounds annotated by, at least, one experimental value for the given endpoint. Support Vector Machine models were internally and externally validated following the OECD principles. Reasonable predictive performances were achieved (RMSEext = 0.56-0.78) which are in line with those of state-of-the-art models. The known structural alerts are compared with analysis of the atomic contributions to these models obtained using the ISIDA/ColorAtom utility. A benchmarking against existing tools has been carried out on a set of compounds considered more representative and relevant for the chemical space of the current chemical industry. Our model scored one of the best accuracy and data coverage. Nevertheless, industrial data performances were noticeably lower than those on public data, indicating that existing models fail to meet the industrial needs. Thus, final models were updated with the inclusion of new industrial compounds, extending the applicability domain and relevance for application in an industrial context. Generated models and collected public data are made freely available.

Keywords: QSAR/QSPR; REACH; acute aquatic toxicity; generative topographic mapping (GTM).

MeSH terms

  • Animals
  • Daphnia / drug effects*
  • Fishes*
  • Microalgae / drug effects*
  • Quantitative Structure-Activity Relationship*
  • Support Vector Machine
  • Toxicity Tests, Acute*
  • Water Pollutants, Chemical / toxicity*

Substances

  • Water Pollutants, Chemical