Methods for assigning confidence to toxicity data with multiple values--Identifying experimental outliers

Sci Total Environ. 2014 Jun 1:482-483:358-65. doi: 10.1016/j.scitotenv.2014.02.115. Epub 2014 Mar 21.

Abstract

The assessment of data quality is a crucial element in many disciplines such as predictive toxicology and risk assessment. Currently, the reliability of toxicity data is assessed on the basis of testing information alone (adherence to Good Laboratory Practice (GLP), detailed testing protocols, etc.). Common practice is to take one toxicity data point per compound - usually the one with the apparently highest reliability. All other toxicity data points (for the same experiment and compound) from other sources are neglected. To show the benefits of incorporating the "less reliable" data, a simple, independent, statistical approach to assess data quality and reliability on a mathematical basis was developed. A large data set of toxicity values to Aliivibrio fischeri was assessed. The data set contained 1813 data points for 1227 different compounds, including 203 identified as non-polar narcotic. Log KOW values were calculated and non-polar narcosis quantitative structure-activity relationship (QSAR) models were built. A statistical approach to data quality assessment, which is based on data outlier omission and confidence scoring, improved the linear QSARs. The results indicate that a beneficial method for using large data sets containing multiple data values per compound and highly variable study data has been developed. Furthermore this statistical approach can help to develop novel QSARs and support risk assessment by obtaining more reliable values for biological endpoints.

Keywords: Confidence; Conflicting data; Data quality; Microtox; Predictive toxicology; QSAR.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Confidence Intervals
  • Hazardous Substances / toxicity*
  • Models, Chemical
  • Quantitative Structure-Activity Relationship
  • Reproducibility of Results
  • Risk Assessment / methods
  • Statistics as Topic
  • Toxicity Tests / methods*

Substances

  • Hazardous Substances