Comparison of Bayesian, k-Nearest Neighbor and Gaussian process regression methods for quantifying uncertainty of suspended sediment concentration prediction

Sci Total Environ. 2022 Apr 20:818:151760. doi: 10.1016/j.scitotenv.2021.151760. Epub 2021 Nov 18.

Abstract

Suspended sediment transport in river system is a complex process influenced by many factors that their interactions lead to nonlinear and high scatter of concentration-discharge relationships. This makes the model prediction subject to high uncertainty and providing one value as the model prediction is somehow useless and cannot provide adequate information about the model accuracy and associated uncertainty. Current study compares the efficiency of Bayesian (i.e. Bayesian segmented linear regression (BSLR) and Bayesian linear model (BLR)), Gaussian Process Regression (GPR) and k-Nearest Neighbor (k-NN) in quantifying uncertainty of the suspended sediment concentration prediction in three watersheds namely Arazkoseh, Oghan and Jajrood located in Iran. Three input combinations including, contemporary discharge, slow and quick flow components and contemporary, one and two antecedent days discharge, were used. The BSLR model was able to identify threshold value, furthermore, pre-threshold and post-threshold slopes of BSLR model indicated that for Arazkoseh watershed channel and for Oghan and Jajrood watersheds, upland area are dominate sediment sources. In all three studied cases, given prediction interval width and the percent of enclosed observed data by prediction interval, k-NN model provided more reliable prediction interval. Moreover, separation stream flow into slow and quick flow components lead to improved performance of GPR and k-NN models in the studied watersheds, and the best results for Arazkoseh and Oghan watersheds were obtained when slow and quick flow components were used as the model input.

Keywords: Flow components; Sediment concentration; Segmented regression; Uncertainty estimation.

MeSH terms

  • Bayes Theorem
  • Environmental Monitoring* / methods
  • Geologic Sediments*
  • Rivers
  • Uncertainty