Predicting sea levels using ML algorithms in selected locations along coastal Malaysia

Heliyon. 2023 Aug 23;9(9):e19426. doi: 10.1016/j.heliyon.2023.e19426. eCollection 2023 Sep.

Abstract

In consideration of the distinct behavior of machine learning (ML) algorithms, six well-defined ML used were carried out in this study for predicting sea level on a day-to-day basis. Data compiled from 1985 to 2018 was utilized for training and testing the developed models. An assessment of the multiple statistics-driven regression algorithms resulted such that each tested location was associated with a particular preferred model. The following were the developed best models for their respective study areas: In Peninsular Malaysia, the interactions linear regression model was the best at Pulau Langkawi (RMSE = 19.066), the Matern 5/2 gaussian process regression model at Geting (RMSE = 49.891), and the trilayered artificial neural network at Pulau Pinang (RMSE = 20.026), while the linear regression model was the best at Sandakan in Sabah, East Malaysia (RMSE = 14.054). Other metrics, such as MAE and R-square, were also at their best values, each providing its best values, further substantiating the RMSE respectively, at each of the study areas. These empirical statistics (or metrics) also revealed that despite employing sea level as the sole parameter, results obtained were exceptional better when utilizing a 7-day lag, regardless of the model used. Notably, lag variables with less than a 7-day lag could degrade the model's accuracy in representing ground reality. The study emphasizes the importance of thorough training and testing of ML to aid decision-makers in developing mitigation actions for the climate change phenomena of sea level rise through reliable ML.

Keywords: Coastal regions; Machine learning; Sea level rise prediction.