Three-dimensional augmentation for hyperspectral image data of water quality: An Integrated approach using machine learning and numerical models

Water Res. 2024 Mar 1:251:121125. doi: 10.1016/j.watres.2024.121125. Epub 2024 Jan 9.

Abstract

This research introduces a comprehensive methodology to enhance hyperspectral image data (HSD) utility, specifically focusing on the three-dimensional (3-D) augmentation of Chlorophyll-a (Chl-a). This study comprises three significant steps: (1) the augmentation of limited field water quality data in terms of time interval and number of variables using neural network models, (2) the generation of 3-D data using numerical models, and (3) the extension of the hyperspectral image data into 3-D data using machine learning models. In the first phase, Multilayer Perceptron (MLP) models were developed to train water quality interactions and successfully generated high-frequency water quality data by adjusting biased measurements and predicting detailed water quality variables. In the second phase, high-frequency data generated by MLP models were applied to develop two numerical models. These numerical models successfully generated 3-D data, thereby demonstrating the effectiveness of integrating numerical modeling with neural networks. In the final phase, ten machine learning models were trained to generate 3-D Chl-a data from HSD. Notably, the Gaussian Process Regression model exhibited superior performance, effectively estimating 3-D Chl-a data with robust accuracy, as evidenced by an R-square value of 0.99. The findings align with theories of algal bloom dynamics, further validating the effectiveness of the approach. This study demonstrated the successful integrated development for HSD extension using machine learning models, numerical models, and original HSD, highlighting the potential of such integrated methodologies in advancing water quality monitoring and estimation. Notably, the approach leverages readily accessible data, allowing for the swift generation of results and bypassing time-consuming data collection processes. This research marks a significant step towards more robust, comprehensive water quality monitoring and prediction, thereby facilitating better management of aquatic ecosystems.

Keywords: 3-D extension; Algal blooms; Hyperspectral image data; Machine learning method; Numerical model.

MeSH terms

  • Chlorophyll A
  • Ecosystem*
  • Environmental Monitoring / methods
  • Machine Learning
  • Neural Networks, Computer
  • Water Quality*

Substances

  • Chlorophyll A