Using data mining techniques to isolate chemical intrusion in water distribution systems

Environ Monit Assess. 2022 Feb 19;194(3):203. doi: 10.1007/s10661-022-09867-z.

Abstract

The security of water distribution systems has become the subject of an increasing volume of research over the last decade. Data analysis and machine learning are linked to hydraulic and quality modeling for improving the capacity of water utilities to save lives when faced with the contamination of water networks. This research applies k-nearest neighbor and random forest algorithms to estimate the location of contamination sources at near-real time. Epanet and Epanet-MSX software are used to simulate intrusions of pesticide into water distribution system and the interaction with compounds already present in water bulk. Different pesticide concentrations are considered in the simulations, and chlorine monitoring occurs through placed quality sensors. The results show that random forest can localize [Formula: see text] of contamination scenarios, while the KNN algorithm found [Formula: see text]. Finally, an assessment of contamination spread is made for a better understanding of the impacts of non-localized contamination.

Keywords: Contamination sources location; K-nearest neighbor algorithm; Random forest algorithm; Water-supply.

MeSH terms

  • Data Mining
  • Environmental Monitoring / methods
  • Water Quality
  • Water Supply*
  • Water*

Substances

  • Water