High-resolution digital mapping of soil organic carbon and soil total nitrogen using DEM derivatives, Sentinel-1 and Sentinel-2 data based on machine learning algorithms

Sci Total Environ. 2020 Aug 10:729:138244. doi: 10.1016/j.scitotenv.2020.138244. Epub 2020 Apr 13.

Abstract

Soil organic carbon (SOC) and soil total nitrogen (STN) are important indicators of soil health and play a key role in the global carbon and nitrogen cycles. High-resolution radar Sentinel-1 and multispectral Sentinel-2 images have the potential to investigate soil spatial distribution information over a large area, although Sentinel-1 and Sentinel-2 data have rarely been combined to map either SOC or STN content. In this study, we applied machine learning techniques to map both SOC and STN content in the southern part of Central Europe using digital elevation model (DEM) derivatives, multi-temporal Sentinel-1 and Sentinel-2 data, and evaluated the potential of different remote sensing sensors (Sentinel-1 and Sentinel-2) to predict SOC and STN content. Four machine-learners including random forest (RF), boosted regression trees (BRT), support vector machine (SVM) and Bagged CART were used to construct predictive models of SOC and STN contents based on 179 soil samples and different combinations of environmental covariates. The performance of these models was evaluated based on a 10-fold cross-validation method by three statistical indicators. Overall, the BRT model performed better than RF, SVM and Bagged CART, and these models yielded similar spatial distribution patterns of SOC and STN. Our results showed that multi-source sensor methods provided more accurate predictions of SOC and STN contents than individual sensors. The application of radar Sentinel-1 and multispectral Sentinel-2 images proved useful for predicting SOC and STN. A combination of Sentinel-1/2-derived predictors and DEM derivatives yielded the highest prediction accuracy. The prediction accuracy changed with and without the Sentinel-1/2-derived predictors, with the R2 for estimating both SOC and STN content using the BRT model increasing by 12.8% and 18.8%, respectively. Topographic variables were the main explanatory variables for SOC and STN predictions, where elevation was assigned as the variable with the most importance by the models. The results of this study illustrate the potential of free high-resolution radar Sentinel-1 and multispectral Sentinel-2 data as input when developing SOC and STN prediction models.

Keywords: Digital soil mapping; Machine learning; Sentinel-1; Sentinel-2; Soil organic carbon; Soil total nitrogen.