The insightful water quality analysis and predictive model establishment via machine learning in dual-source drinking water distribution system

Environ Res. 2024 Feb 16:250:118474. doi: 10.1016/j.envres.2024.118474. Online ahead of print.

Abstract

Dual-source drinking water distribution systems (DWDS) over single-source water supply systems are becoming more practical in providing water for megacities. However, the more complex water supply problems are also generated, especially at the hydraulic junction. Herein, we have sampled for a one-year and analyzed the water quality at the hydraulic junction of a dual-source DWDS. The results show that visible changes in drinking water quality, including turbidity, pH, UV254, DOC, residual chlorine, and trihalomethanes (TMHs), are observed at the sample point between 10 and 12 km to one drinking water plant. The average concentration of residual chlorine decreases from 0.74 ± 0.05 mg/L to 0.31 ± 0.11 mg/L during the water supplied from 0 to 10 km and then increases to 0.75 ± 0.05 mg/L at the end of 22 km. Whereas the THMs shows an opposite trend, the concentration reaches to a peak level at hydraulic junction area (10-12 km). According to parallel factor (PARAFAC) and high-performance size-exclusion chromatography (HPSEC) analysis, organic matters vary significantly during water distribution, and tryptophan-like substances and amino acids are closely related to the level of THMs. The hydraulic junction area is confirmed to be located at 10-12 km based on the water quality variation. Furthermore, data-driven models are established by machine learning (ML) with test R2 higher than 0.8 for THMs prediction. And the SHAP analysis explains the model results and identifies the positive (water temperature and water supply distance) and negative (residual chlorine and pH) key factors influencing the THMs formation. This study conducts a deep understanding of water quality at the hydraulic junction areas and establishes predictive models for THMs formation in dual-sources DWDS.

Keywords: Dual-source water distribution system; Machine learning; Parallel factor and size-exclusion chromatography; Residual chlorine; THMs.