Persistent organic pollutants (POPs) - QSPR classification models by means of Machine learning strategies

Chemosphere. 2022 Jan;287(Pt 2):132189. doi: 10.1016/j.chemosphere.2021.132189. Epub 2021 Sep 10.

Abstract

Persistent Organic pollutants (POPs) are toxic chemicals with a shallow degradation rate and global negative impact. Their physicochemical is combined with the complex effects of long-term POPs accumulation in the environment and transport function through the food chain. That is why POPs have been linked to adverse effects on human health and animals. They circulate globally via different environmental pathways, and could be detected in regions far from their source of origin. The primary goal of the present study is to carry out classification of various representatives of POPs using different theoretical descriptors (molecular, structural) to develop quantitative structure-properties relationship (QSPR) models for predicting important properties POPs. Multivariate statistical methods such as hierarchical cluster analysis, principal components analysis and self-organizing maps were applied to reach excellent partitioning of 149 representatives of POPs into 4 classes using ten most appropriate descriptors (out of 63) defined by variable reduction procedure. The predictive capabilities of the defined classes could be applied as a pattern recognition for new and unidentified POPs, based only on structural properties that similar molecules may have. The additional self-organizing maps technique made it possible to visualize the feature-space and investigate possible patterns and similarities between POPs molecules. It contributes to confirmation of the proper classification into four classes. Based on SOM results, the effect of each variable and pattern formation has been presented.

Keywords: Classification; Persistent organic pollutants; Self-organizing maps; Supervised and non supervised chemometric techniques; Training of data.

MeSH terms

  • Animals
  • Environmental Pollutants* / analysis
  • Food Chain
  • Humans
  • Machine Learning
  • Persistent Organic Pollutants*
  • Quantitative Structure-Activity Relationship

Substances

  • Environmental Pollutants