Predicting the Distribution of Arsenic in Groundwater by a Geospatial Machine Learning Technique in the Two Most Affected Districts of Assam, India: The Public Health Implications

Geohealth. 2022 Mar 1;6(3):e2021GH000585. doi: 10.1029/2021GH000585. eCollection 2022 Mar.

Abstract

Arsenic (As) is a well-known carcinogen and chemical contaminant in groundwater. The spatial heterogeneity in As distribution in groundwater makes it difficult to predict the location of safe areas for tube well installations, consumption, and agriculture. Geospatial machine learning techniques have been used to predict the location of safe and unsafe areas of groundwater As. We used a similar machine learning technique and developed a habitation-level (spatial resolution 250 m) predictive model to determine the risk and extent of As >10 μg/L in groundwater in the two most affected districts of Assam, India, with an aim to advise policymakers on targeted interventions. A random forest model was employed in Python environments to predict the probabilities of As at concentrations >10 μg/L using intrinsic and extrinsic predictor variables, which were selected for their inherent relationship with As occurrence in groundwater. The relationships between predictor variables and proportions of As occurrences >10 μg/L follow the well-documented processes leading to As release in groundwater. We identified potential As hotspots based on a probability of ≥0.7 for As >10 μg/L, including regions not previously surveyed and extending beyond previously known As hotspots. Of the total land area (6,500 km2), 25% was identified as a high-risk zone, with an estimated 155,000 people potentially consuming As through drinking water or cooking food. The ternary hazard probability map (showing high, moderate, and low risk for As >10 μg/L) could inform policymakers on establishing newer drinking water treatment plants and providing safe drinking water connections to rural households.

Keywords: Assam; arsenic; machine learning; predictive modeling; random forest model.