Groundwater Salinity Across India: Predicting Occurrences and Controls by Field-Observations and Machine Learning Modeling

Environ Sci Technol. 2024 Feb 27;58(8):3953-3965. doi: 10.1021/acs.est.3c06525. Epub 2024 Feb 15.

Abstract

Elevated groundwater salinity is unsuitable for drinking and harmful to crop production. Thus, it is crucial to determine groundwater salinity distribution, especially where drinking and agricultural water requirements are largely supported by groundwater. This study used field observation (n = 20,994)-based machine learning models to determine the probabilistic distribution of elevated groundwater salinity (electrical conductivity as a proxy, >2000 μS/cm) at 1 km2 across parts of India for near groundwater-table conditions. The final predictions were made by using the best-performing random forest model. The validation performance also demonstrated the robustness of the model (with 77% accuracy). About 29% of the study area (including 25% of entire cropland areas) was estimated to have elevated salinity, dominantly in northwestern and peninsular India. Also, parts of the northwestern and southeastern coasts, adjoining the Arabian Sea and the Bay of Bengal, were assessed with elevated salinity. The climate was delineated as the dominant factor influencing groundwater salinity occurrence, followed by distance from the coast, geology (lithology), and depth of groundwater. Consequently, ∼330 million people, including ∼109 million coastal populations, were estimated to be potentially exposed to elevated groundwater salinity through groundwater-sourced drinking water, thus substantially limiting clean water access.

Keywords: climate; groundwater salinity; machine learning; population exposure; random forest.

MeSH terms

  • Drinking Water*
  • Environmental Monitoring
  • Groundwater*
  • Humans
  • India
  • Salinity
  • Water Pollutants, Chemical* / analysis

Substances

  • Drinking Water
  • Water Pollutants, Chemical