Mapping specific groundwater nitrate concentrations from spatial data using machine learning: A case study of chongqing, China

Heliyon. 2024 Mar 13;10(6):e27867. doi: 10.1016/j.heliyon.2024.e27867. eCollection 2024 Mar 30.

Abstract

Groundwater resources is not only important essential water resources but also imperative connectors within the intricate framework of the ecological environment. High nitrate concentrations in groundwater can exerting adverse impacts on human health. It is imperative to accurately delineate the distribution characteristics of groundwater nitrate concentrations. Four different machine learning models (Gradient Boosting Regression (GB), Random Forest Regression (RF), Extreme Gradient Boosting Regression (XG) and Adaptive Boosting Regression (AD)) which combine spatial environmental data and different radius contributing area was developed to predict the distribution of nitrate concentration in groundwater. The models use 595 groundwater samples and included topography, remote sensing, hydrogeological and hydrological, climate, nitrate input, and socio-economic predictor. Gradient Boosting Regression model outperforms the other models (R2 = 0.627, MAE = 0.529, RMSE = 0.705, PICP = 0.924 for test dataset) under 500 m radius contributing area. A high-resolution (1 km) groundwater nitrate concentration distribution map reveal in the majority of the study area, groundwater nitrate concentrations are below 1 mg/L and high nitrate concentration (>10 mg/L) proportion in southeast, northeast and central main urban area karst valley regions is 1.89%, 0.91%, and 0.38% respectively. In study area, hydrogeological conditions, soil parameters, nitrogen input factors, and percentage of arable land are among the most influential explanatory factors. This work, serving as the inaugural application of utilizing effective spatial methods for predicting groundwater nitrate concentrations in Chongqing city, furnish decision-making support for the prevention and control of groundwater pollution, particularly in areas primarily dependent on groundwater for water supply and holds profound significance as a milestone achievement.

Keywords: GIS; Groundwater nitrate contamination; Machine learning models; Uncertainty assessment.