Machine learning-based water quality prediction using octennial in-situ Daphnia magna biological early warning system data

Heewon Jeong; Sanghyun Park; Byeongwook Choi; Chung Seok Yu; Ji Young Hong; Tae-Yong Jeong; Kyung Hwa Cho

doi:10.1016/j.jhazmat.2023.133196

Machine learning-based water quality prediction using octennial in-situ Daphnia magna biological early warning system data

J Hazard Mater. 2024 Mar 5:465:133196. doi: 10.1016/j.jhazmat.2023.133196. Epub 2023 Dec 8.

Authors

Heewon Jeong¹, Sanghyun Park², Byeongwook Choi³, Chung Seok Yu², Ji Young Hong², Tae-Yong Jeong⁴, Kyung Hwa Cho⁵

Affiliations

¹ Department of Urban and Environmental Engineering, Ulsan National Institute of Science and Technology (UNIST), UNIST-gil 50, Ulsan 44919, Republic of Korea.
² The National Institute of Environmental Research, 42 Hwangyeong-ro, Seo-gu, Incheon 22689, Republic of Korea.
³ Department of Environmental Science, Hankuk University of Foreign Studies, Oedae-ro 81, Yongin-si, Gyeonggi-do 17035, Republic of Korea.
⁴ Department of Environmental Science, Hankuk University of Foreign Studies, Oedae-ro 81, Yongin-si, Gyeonggi-do 17035, Republic of Korea. Electronic address: tyj@hufs.ac.kr.
⁵ School of Civil, Environmental and Architectural Engineering, Korea University, Seoul 02841, Republic of Korea. Electronic address: khcho80@korea.ac.kr.

PMID: 38141299
DOI: 10.1016/j.jhazmat.2023.133196

Abstract

Biological early warning system (BEWS) has been globally used for surface water quality monitoring. Despite its extensive use, BEWS has exhibited limitations, including difficulties in biological interpretation and low alarm reproducibility. This study addressed these issues by applying machine learning (ML) models to eight years of in-situ BEWS data for Daphnia magna. Six ML models were adopted to predict contamination alarms from Daphnia behavioral parameters. The light gradient boosting machine model demonstrated the most significant improvement in predicting alarms from Daphnia behaviors. Compared with the traditional BEWS alarm index, the ML model enhanced the precision and recall by 29.50% and 43.41%, respectively. The speed distribution index and swimming speed were significant parameters for predicting water quality warnings. The nonlinear relationships between the monitored Daphnia behaviors and water physicochemical water quality parameters (i.e., flow rate, Chlorophyll-a concentration, water temperature, and conductivity) were identified by ML models for simulating Daphnia behavior based on the water contaminants. These findings suggest that ML models have the potential to establish a robust framework for advancing the predictive capabilities of BEWS, providing a promising avenue for real-time and accurate assessment of water quality. Thereby, it can contribute to more proactive and effective water quality management strategies.

Keywords: Biological early warning system; Daphnia magna; Explainable models; Machine learning models; Water quality.

MeSH terms

Animals
Daphnia
Daphnia magna
Reproducibility of Results
Swimming
Water Pollutants, Chemical* / pharmacology
Water Quality*

Substances

Water Pollutants, Chemical