Applications of various data-driven models for the prediction of groundwater quality index in the Akot basin, Maharashtra, India

Environ Sci Pollut Res Int. 2022 Mar;29(12):17591-17605. doi: 10.1007/s11356-021-17064-7. Epub 2021 Oct 20.

Abstract

Data-driven models are important to predict groundwater quality which is controlling human health. The water quality index (WQI) has been developed based on the physicochemical parameters of water samples. In this area, water quality is medium to poor and is found in saline zones; very high pH ranges are directly affected on the water quality in this study area. Conventional WQI computation demands more time and is often observed with enormous errors during the calculation of sub-indices. In the present work, four standalone methods such as additive regression (AR), M5P tree model (M5P), random subspace (RSS), and support vector machine (SVM) were employed to predict WQI based on variable elimination technique. The groundwater samples were collected from the Akot basin area, located in the Akola district, Maharashtra, in India. A total of nine different input combinations were developed in this study. The datasets were demarcated into two classes (ratio 80:20) for model construction (training dataset) and model verification (testing dataset) using a fivefold cross-validation approach. The models were assessed using statistical and graphical appraisal metrics. The best input combinations varied among the model, generally, the optimal input variables (EC, pH, TDS, Ca, Mg, and Cl) during the training and validation stages. Results show that AR outperformed the other data-driven models (R2 = 0.9993, MAE = 0.5243, RMSE = 0.0.6356, %RAE = 3.8449, and RRSE% = 3.9925). The AR is proposed as an ideal model with satisfactory results due to enhanced prediction precision with the minimum number of input parameters and can thus act as the reliable and precise method in the prediction of WQI at the Akot basin.

Keywords: Additive regression; Data-driven models; Groundwater; Water quality index.

MeSH terms

  • Environmental Monitoring / methods
  • Groundwater*
  • Humans
  • India
  • Water Pollutants, Chemical* / analysis
  • Water Quality

Substances

  • Water Pollutants, Chemical