Deep learning-based prediction of effluent quality of a constructed wetland

Environ Sci Ecotechnol. 2022 Sep 24:13:100207. doi: 10.1016/j.ese.2022.100207. eCollection 2023 Jan.

Abstract

Data-driven approaches that make timely predictions about pollutant concentrations in the effluent of constructed wetlands are essential for improving the treatment performance of constructed wetlands. However, the effect of the meteorological condition and flow changes in a real scenario are generally neglected in water quality prediction. To address this problem, in this study, we propose an approach based on multi-source data fusion that considers the following indicators: water quality indicators, water quantity indicators, and meteorological indicators. In this study, we establish four representative methods to simultaneously predict the concentrations of three representative pollutants in the effluent of a practical large-scale constructed wetland: (1) multiple linear regression; (2) backpropagation neural network (BPNN); (3) genetic algorithm combined with the BPNN to solve the local minima problem; and (4) long short-term memory (LSTM) neural network to consider the influence of past results on the present. The results suggest that the LSTM-predicting model performed considerably better than the other deep neural network-based model or linear method, with a satisfactory R2. Additionally, given the huge fluctuation of different pollutant concentrations in the effluent, we used a moving average method to smooth the original data, which successfully improved the accuracy of traditional neural networks and hybrid neural networks. The results of this study indicate that the hybrid modeling concept that combines intelligent and scientific data preprocessing methods with deep learning algorithms is a feasible approach for forecasting water quality in the effluent of actual engineering.

Keywords: Constructed wetlands; Deep learning; LSTM; Multi-source data fusion; Water quality prediction.