Using an ensemble model coupled with portable X-ray fluorescence and visible near-infrared spectroscopy to explore the viability of mapping and estimating arsenic in an agricultural soil

Sci Total Environ. 2022 Apr 20:818:151805. doi: 10.1016/j.scitotenv.2021.151805. Epub 2021 Nov 20.

Abstract

Increasing concentrations of potentially toxic elements (PTE) in agricultural soils remain a major source of public concern. Monitoring PTEs in an agricultural field with no history of contaminants necessitate adequate analysis utilizing a robust model to accurately uncover hidden PTEs. Detecting and mapping the distribution of soil properties using portable X-ray fluorescence (pXRF) and proximal sensing techniques is not only rapid, but also relatively inexpensive. In this study, an ensemble model, consisting of partial least square regression (PLSR), support vector machine (SVM), random forest (RF) and cubist, was used for the prediction and mapping of soil As content in an agricultural field with no history of pollution. The datasets were collected using pXRF and field spectroscopy techniques. The main goal was to compare the ensemble model to each of the calibration techniques in terms of prediction accuracy of As content in such a field. Other components [e.g., soil organic carbon (SOC), Mn, S, soil pH, Fe] that are known to influence As levels in the soil were also retrieved to assess their correlation with soil As. The models were evaluated using the root mean squared error (RMSECV), the coefficient of determination (R2CV) and the ratio of performance to interquartile range (RPIQ). In terms of prediction accuracy, the ensemble model outperformed each of the individual techniques (R2CV = 0.80/0.75) and obtained the least error margin (RMSECV = 1.91/2.16). Overall, all the predictive techniques were able to detect both low and high estimated values of soil As within the study field, but with the ensemble model resembling the measurements better. The ensemble model, a promising tool as demonstrated by the current study, is highly recommended to be included in future studies for more accurate estimation of As and other PTEs in other agricultural fields.

Keywords: Agricultural soil; Arsenic; Ensemble model; Field spectroscopy; Machine learning; Portable X-ray fluorescence (pXRF).

MeSH terms

  • Arsenic* / analysis
  • Carbon
  • Soil / chemistry
  • Soil Pollutants* / analysis
  • Spectroscopy, Near-Infrared
  • X-Rays

Substances

  • Soil
  • Soil Pollutants
  • Carbon
  • Arsenic