Linking multi-media modeling with machine learning to assess and predict lake Chlorophyll a concentrations

J Great Lakes Res. 2021 Dec 13;47(6):1656-1670. doi: 10.1016/j.jglr.2021.09.011.

Abstract

Eutrophication and excessive algal growth pose a threat on aquatic organisms and the health of the public, environment, and the economy. Understanding what drives excessive algal growth can inform mitigation measures and aid in advance planning to minimize impacts. We demonstrate how simulated data from weather, hydrological, and agroecosystem numerical prediction models can be combined with machine learning (ML) to assess and predict Chlorophyll a (Chl a) concentrations, a proxy for lake eutrophication and algal biomass. The study area is Lake Erie for a 16-year period, 2002-2017. A total of 20 environmental variables from linked and coupled physical models are used as input features to train the ML model with Chl a observations from 16 measuring stations. Included are meteorological variables from the Weather Research and Forecasting (WRF) model, hydrological variables from the Variable Infiltration Capacity (VIC) model, and agricultural management practice variables from the Environmental Policy Integrated Climate (EPIC) agroecosystem model. The consolidation of these variables is conducive to a successful prediction of Chl a. Aside from the synergistic effects that weather, hydrology, and fertilizers have on eutrophication and excessive algal growth, we found that the application of different forms of both P and N fertilizers are highly ranked for the prediction of Chl a concentration. The developed ML model successfully predicts Chl a with a coefficient of determination of 0.81, bias of -0.12 μg/l and RMSE of 4.97 μg/l. The developed ML-based modeling approach can be used for impact assessment of agriculture practices in a changing climate that affect Chl a concentrations in Lake Erie.

Keywords: Fertilizers; Lake eutrophication; Machine learning; Numerical prediction models.