Prediction of Dichloroethene Concentration in the Groundwater of a Contaminated Site Using XGBoost and LSTM

Int J Environ Res Public Health. 2022 Jul 30;19(15):9374. doi: 10.3390/ijerph19159374.

Abstract

Chlorinated aliphatic hydrocarbons (CAHs) are widely used in agriculture and industries and have become one of the most common groundwater contaminations. With the excellent performance of the deep learning method in predicting, LSTM and XGBoost were used to forecast dichloroethene (DCE) concentrations in a pesticide-contaminated site undergoing natural attenuation. The input variables included BTEX, vinyl chloride (VC), and five water quality indicators. In this study, the predictive performances of long short-term memory (LSTM) and extreme gradient boosting (XGBoost) were compared, and the influences of variables on models' performances were evaluated. The results indicated XGBoost was more likely to capture DCE variation and was robust in high values, while the LSTM model presented better accuracy for all wells. The well with higher DCE concentrations would lower the model's accuracy, and its influence was more evident in XGBoost than LSTM. The explanation of the SHapley Additive exPlanations (SHAP) value of each variable indicated high consistency with the rules of biodegradation in the real environment. LSTM and XGBoost could predict DCE concentrations through only using water quality variables, and LSTM performed better than XGBoost.

Keywords: LSTM; SHapley Additive exPlanations; XGBoost; contaminated site; dichloroethene; groundwater; machine learning; natural attenuation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biodegradation, Environmental
  • Groundwater*
  • Hydrocarbons, Chlorinated*
  • Vinyl Chloride* / metabolism

Substances

  • Hydrocarbons, Chlorinated
  • Vinyl Chloride

Grants and funding

This research was sponsored by the National Key Research and Development Program of China (No. 2018YFC1800202), National Natural Science Foundation of China (No. 41977139), Special Fund of Chinese Central Government for Basic Scientific Research Operations in Commonweal Research Institute (No. GYZX220101), Special Fund of Chinese Central Government for Basic Scientific Research Operations in Commonweal Research Institute (No. GYZX220303), and Jiangsu Innovative and Entrepreneurial Talent Programme (JSSCBS20211318).