Optimal zinc level and uncertainty quantification in agricultural soils via visible near-infrared reflectance and soil chemical properties

J Environ Manage. 2023 Jan 15;326(Pt A):116701. doi: 10.1016/j.jenvman.2022.116701. Epub 2022 Nov 14.

Abstract

Zinc (Zn) is a vital element required by all living creatures for optimal health and ecosystem functioning. Therefore, several researchers have modeled and mapped its occurrence and distribution in soils. Nonetheless, leveraging model predictive performances while coupling information derived from visible near-infrared (Vis-NIR) and soils (i.e. chemical properties) to estimate potential toxic elements (PTEs) like Zn in agricultural soils is largely untapped. This study applies two methods to rapidly monitor Zn concentration in agricultural soil. Firstly, employing Vis-NIR and machine learning algorithms (MLAs) (Context 1) and secondly, applying Vis-NIR, soil chemical properties (SCP), and MLAs (Context 2). For the Vis-NIR information, single and combined pretreatment methods were applied. The following MLAs were used: conditional inference forest (CIF), partial least squares regression (PLSR), M5 tree model (M5), extreme gradient boosting (EGB), and support vector machine regression (SVMR) respectively. For context 1, the results indicated that M5-MSC (M5 tree model-multiplicative scatter correction) with coefficient of determination (R2) = 0.72, root mean square error (RMSE) = 21.08 (mg/kg), median absolute error (MdAE) = 13.69 and ratio of performance to interquartile range (RPIQ) = 1.63 was promising. Regarding context 2, CIF with spectral pretreatment and soil properties [CIF-DWTLOGMSC + SCP (conditional inference forest-discrete wavelet transformation-logarithmic transformation-multiplicative scatter correction-soil chemical properties)] yielded the best performance of R2 = 0.86, RMSE = 14.52 (mg/kg), MdAE = 6.25 and RPIQ = 1.78. Altogether, for contexts 1 and 2, the CIF-DWTLOGMSC + SCP approach (context 2) was the best Zn model outcome for the agricultural soil. The uncertainty map revealed a low to high error distribution in context 1, and a low to moderate distribution in context 2 for all models except CIF, which had some patches with high uncertainty. We conclude that a multiple optimization approach for modeling Zn levels in agricultural soils is invaluable and may provide fast and reliable information needed for area-specific decision-making.

Keywords: Agricultural soil; Conditional inference forest; Spectral reflectance; Uncertainty assessment; Zinc.

MeSH terms

  • Agriculture
  • Ecosystem*
  • Soil*
  • Uncertainty
  • Zinc

Substances

  • Soil
  • Zinc