toxCSM: comprehensive prediction of small molecule toxicity profiles

Brief Bioinform. 2022 Sep 20;23(5):bbac337. doi: 10.1093/bib/bbac337.

Abstract

Drug discovery is a lengthy, costly and high-risk endeavour that is further convoluted by high attrition rates in later development stages. Toxicity has been one of the main causes of failure during clinical trials, increasing drug development time and costs. To facilitate early identification and optimisation of toxicity profiles, several computational tools emerged aiming at improving success rates by timely pre-screening drug candidates. Despite these efforts, there is an increasing demand for platforms capable of assessing both environmental as well as human-based toxicity properties at large scale. Here, we present toxCSM, a comprehensive computational platform for the study and optimisation of toxicity profiles of small molecules. toxCSM leverages on the well-established concepts of graph-based signatures, molecular descriptors and similarity scores to develop 36 models for predicting a range of toxicity properties, which can assist in developing safer drugs and agrochemicals. toxCSM achieved an Area Under the Receiver Operating Characteristic (ROC) Curve (AUC) of up to 0.99 and Pearson's correlation coefficients of up to 0.94 on 10-fold cross-validation, with comparable performance on blind test sets, outperforming all alternative methods. toxCSM is freely available as a user-friendly web server and API at http://biosig.lab.uq.edu.au/toxcsm.

Keywords: graph-based signatures; machine learning; toxCSM; toxicity; toxicity predictions.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Agrochemicals*
  • Drug Discovery* / methods
  • Humans
  • ROC Curve

Substances

  • Agrochemicals