Review of machine learning-based surrogate models of groundwater contaminant modeling

Environ Res. 2023 Dec 1;238(Pt 2):117268. doi: 10.1016/j.envres.2023.117268. Epub 2023 Sep 28.

Abstract

Heavy computational load inhibits the application of groundwater contaminant numerical model to groundwater pollution source identification, remediation design, and uncertainty analysis, since a large number of model runs are required for these applications. Machine learning-based surrogate models are an effective approach to enhance the efficiency of the numerical models, and have recently attracted considerable attention in the field of groundwater contaminant modeling. Here, we review 120 research articles on machine learning-based surrogate models for groundwater contaminant modeling that were published between 1994 and 2022. We outline the state of the art method, identify the most significant research challenges, and suggest potential future directions. The six major applications of machine learning-based surrogate models are groundwater pollution source identification, groundwater remediation design, coastal aquifer management, uncertainty analysis of groundwater, groundwater monitoring network design, and groundwater transport parameters inversion. Together, these account for more than 90% of the studies we review. Latin hypercube sampling (LHS) is the most widely used sampling method, and artificial neural networks (ANNs) and Kriging are the two most widely used methods for constructing surrogate model. No method is universally superior, the advantages and disadvantages of different methods, as well as the applicability of these methods for different application purposes of groundwater contaminant modeling were analyzed. Some recommendations on the method selection for various application fields are given based on the reviews and experiences. Based on our review of the state-of-the-art, we suggest several future research directions to enhance the feasibility of the machine learning-based surrogate models of groundwater contaminant modeling: the alleviation of the curse of dimensionality, enhancing transferability, practical applications for real case studies, multi-source dada fusion, and real-time monitoring and prediction.

Keywords: Artificial neural network; Groundwater contaminant transport modeling; Machine learning; Surrogate model.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Environmental Pollution
  • Groundwater*
  • Machine Learning
  • Models, Theoretical*
  • Neural Networks, Computer