Application of fuzzy c-means clustering to PRTR chemicals uncovering their release and toxicity characteristics

Sci Total Environ. 2018 May 1:622-623:861-868. doi: 10.1016/j.scitotenv.2017.12.032. Epub 2017 Dec 13.

Abstract

Increasing manufacture and usage of chemicals have not been matched by the increase in our understanding of their risks. Pollutant release and transfer register (PRTR) is becoming a popular measure for collecting chemical data and enhancing the public right to know. However, these data are usually in high dimensionality which restricts their wider use. The present study partitions Japanese PRTR chemicals into five fuzzy clusters by fuzzy c-mean clustering (FCM) to explore the implicit information. Each chemical with membership degrees belongs to each cluster. Cluster I features high releases from non-listed industries and the household sector and high environmental toxicity. Cluster II is characterized by high reported releases and transfers from 24 listed industries above the threshold, mutagenicity, and high environmental toxicity. Chemicals in cluster III have characteristics of high releases from non-listed industries and low toxicity. Cluster IV is characterized by high reported releases and transfers from 24 listed industries above the threshold and extremely high environmental toxicity. Cluster V is characterized by low releases yet mutagenicity and high carcinogenicity. Chemicals with the highest membership degree were identified as representatives for each cluster. For the highest membership degree, half of the chemicals have a value higher than 0.74. If we look at both the highest and the second highest membership degrees simultaneously, about 94% of the chemicals have a value higher than 0.5. FCM can serve as an approach to uncover the implicit information of highly complex chemical dataset, which subsequently supports the strategy development for efficient and effective chemical management.

Keywords: Chemical management; Fuzzy c-means clustering; PRTR; Release and toxicity; Risk assessment.

MeSH terms

  • Algorithms
  • Cluster Analysis
  • Environmental Pollutants / toxicity*
  • Fuzzy Logic*
  • Industrial Waste
  • Industry
  • Japan
  • Risk

Substances

  • Environmental Pollutants
  • Industrial Waste