Water quality modelling using principal component analysis and artificial neural network

Mar Pollut Bull. 2023 Feb:187:114493. doi: 10.1016/j.marpolbul.2022.114493. Epub 2022 Dec 23.

Abstract

The study investigates the latent pollution sources and most significant parameters that cause spatial variation and develops the best input for water quality modelling using principal component analysis (PCA) and artificial neural network (ANN). The dataset, 22 water quality parameters were obtained from Department of Environment Malaysia (DOE). The PCA generated six significant principal component scores (PCs) which explained 65.40 % of the total variance. Parameters for water quality variation are mainlyrelated to mineral components, anthropogenic activities, and natural processes. However, in ANN three input combination models (ANN A, B, and C) were developed to identify the best model that can predict water quality index (WQI) with very high precision. ANN A model appears to have the best prediction capacity with a coefficient of determination (R2) = 0.9999 and root mean square error (RMSE) = 0.0537. These results proved that the PCA and ANN methods can be applied as tools for decision-making and problem-solving for better managing of river quality.

Keywords: Artificial neural network; Pattern recognition; Principal component analysis; Varimax rotation; Water pollution; Water quality index.

MeSH terms

  • Environmental Monitoring* / methods
  • Neural Networks, Computer
  • Principal Component Analysis
  • Rivers
  • Water Quality*