Enhancing short-term streamflow prediction in the Haihe River Basin through integrated machine learning with Lasso

Water Sci Technol. 2024 May;89(9):2367-2383. doi: 10.2166/wst.2024.142. Epub 2024 May 2.

Abstract

With the widespread application of machine learning in various fields, enhancing its accuracy in hydrological forecasting has become a focal point of interest for hydrologists. This study, set against the backdrop of the Haihe River Basin, focuses on daily-scale streamflow and explores the application of the Lasso feature selection method alongside three machine learning models (long short-term memory, LSTM; transformer for time series, TTS; random forest, RF) in short-term streamflow prediction. Through comparative experiments, we found that the Lasso method significantly enhances the model's performance, with a respective increase in the generalization capabilities of the three models by 21, 12, and 14%. Among the selected features, lagged streamflow and precipitation play dominant roles, with streamflow closest to the prediction date consistently being the most crucial feature. In comparison to the TTS and RF models, the LSTM model demonstrates superior performance and generalization capabilities in streamflow prediction for 1-7 days, making it more suitable for practical applications in hydrological forecasting in the Haihe River Basin and similar regions. Overall, this study deepens our understanding of feature selection and machine learning models in hydrology, providing valuable insights for hydrological simulations under the influence of complex human activities.

Keywords: Lasso; long short-term memory; random forest; streamflow prediction; transformer for time series.

MeSH terms

  • China
  • Forecasting / methods
  • Hydrology
  • Machine Learning*
  • Models, Theoretical
  • Rivers*
  • Water Movements