Taking full advantage of 'low-quality' datasets in watershed modeling and management: From a perspective of parameter calibration

Yi Rong; Chengxin Qin; Haw Yen; Fu Sun; Pengfei Du; Siyu Zeng

doi:10.1016/j.jenvman.2023.119955

Taking full advantage of 'low-quality' datasets in watershed modeling and management: From a perspective of parameter calibration

J Environ Manage. 2024 Feb:351:119955. doi: 10.1016/j.jenvman.2023.119955. Epub 2024 Jan 1.

Authors

Yi Rong¹, Chengxin Qin², Haw Yen³, Fu Sun², Pengfei Du², Siyu Zeng⁴

Affiliations

¹ Tsinghua University, China; Chinese Academy of Environmental Planning, China.
² Tsinghua University, China.
³ Environmental Exposure Modeling, Regulatory Science North America, Bayer US Crop Science Division, Chesterfield, 63017, USA.
⁴ Tsinghua University, China. Electronic address: szeng@mail.tsinghua.edu.cn.

PMID: 38169264
DOI: 10.1016/j.jenvman.2023.119955

Abstract

The quality of calibration datasets is critical for establishing well-calibrated models for reliable decision-making support. However, the analysis of the influence of calibration dataset quality and the discussion on how to use flawed and/or incomplete datasets are still far from sufficient. An evaluation framework for the impact of model calibration data on parameter identifiability, sensitivity, and uncertainty (ISU) was established. Three quantitative and normalized indicators were designed to describe the magnitude of ISU. With the case study of the upper Daqing River watershed, China and the model SWAT (Soil and Water Assessment Tool), one ideal dataset without quality flaws and 79 datasets with different types of flaws including observation error, low monitoring frequency, short data duration and low data resolution were evaluated. The result showed that 4 of 13 parameters that control canopy, groundwater and channel processes have higher ISU values, indicating the high identifiability, high sensitivity, and low uncertainty. The largest gap of parameter ISU between dataset with quality flaw and ideal dataset was 0.61 due to short data duration, while the smallest gap was -0.28 due to low monitoring data frequency. Although some defective datasets caused unacceptable calibration results and model output, some defective datasets can still be valuable for model calibration which depends on the hydrological processes of interest when applying the model. Equivalent calibration results were yielded by the datasets with similar statistical properties. When using datasets with traditional defective issues for calibration, a new step checking the consistency among decision goal, representative system process, determinative parameters and calibration datasets is suggested. Practices including process-related data selection, dataset regrouping and risk self-reporting when using low-quality datasets are encouraged to increase the reliability of model-based watershed management.

MeSH terms

Calibration
Models, Theoretical*
Reproducibility of Results
Soil
Water Quality*

Substances

Soil