Drivers of harmful algal blooms in coastal areas of Eastern Mediterranean: a machine learning methodological approach

Math Biosci Eng. 2021 Jul 28;18(5):6484-6505. doi: 10.3934/mbe.2021322.

Abstract

Harmful algal species are present in the Mediterranean Sea and are often associated with toxic events affecting the nearby coastal zones. The presence of 18 marine microalgae, at genus level, associated with potentially harmful characteristics was predicted using a number of machine learning techniques based exclusively on a small set of abiotic variables, already identified as drivers of blooms. Random Forest (RF) algorithm achieved the best predictive performance by correctly identifying the presence of most genera with a mean of 89.2% of total samples. Although, RF has shown lower predictive performance for genera present in a low number of samples, its predictive power remains at least "fair' in these cases. The main tree-based advantage of RF was thereafter used to assess the importance of the input variables in predicting the presence of the algal genera. Temperature had the most powerful effect on genera's presences, although this effect varies among genera. Finally, the genera were clustered based on their response to the considered abiotic variables and common trends in an ecological context were identified.

Keywords: Eastern Mediterranean; Random Forest; abiotic parameters; harmful algal; machine learning.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Harmful Algal Bloom*
  • Machine Learning*
  • Temperature