A new approach based on biology-inspired metaheuristic algorithms in combination with random forest to enhance the flood susceptibility mapping

J Environ Manage. 2023 Nov 1:345:118790. doi: 10.1016/j.jenvman.2023.118790. Epub 2023 Aug 28.

Abstract

Flash floods are one of the worst natural disasters, causing massive economic losses and many deaths. Creating a flood susceptibility map (FSM) that pinpoints the areas most at risk of flooding is a crucial non-structural solution for managing floods. This study aimed to assess the efficacy of combinations of the random forest (RF) model with three biology-inspired metaheuristic algorithms, namely invasive weed optimization (IWO), slime mould algorithm (SMA), and satin bowerbird optimization (SBO), for flood susceptibility mapping in Estahban town, Iran. Initially, synthetic-aperture radar (SAR) (Sentinel-1) and optical (Landsat-8) satellite images were integrated to monitor the flooded areas during the July 2022 monsoon in the study area. A dataset of 509 flood occurrence points was created to identify flood-prone areas using remote sensing techniques, considering the monitored flood areas. The dataset also included twelve flood-related criteria: topography, land cover, and climate. The holdout method was employed for modeling, with a ratio of 70:30 used for the train/test split. Data pre-processing techniques were conducted to improve model performance, including determining criteria importance and addressing multicollinearity issues using certainty factor (CF), multicollinearity, and information gain ratio (IGR) methods. Then FSM was prepared using RF, RF-IWO, RF-SBO, and RF-SMA models. The findings of this research revealed that the RF-IWO model was the best predictive model of flood susceptibility modeling, with root-mean-square-error (RMSE) (0.211 and 0.0.27), mean-absolute-error (MAE) (0.103 and 0.15), and coefficient-of-determination (R2) (0.821 and 0.707) in the training and testing phases, respectively. Receiver operating characteristic (ROC) curve analysis of FSM revealed that the most accurate models were the RF-IWO (area under the curve (AUC) = 0.983), RF-SBO (AUC = 0.979), RF-SMA (AUC = 0.963), and RF (AUC = 0.959), respectively. Integrating biology-inspired computing algorithms with machine learning algorithms presents a novel approach to enhancing the accuracy of FSMs.

Keywords: Biology-inspired algorithms; Ensemble algorithm; Flash flood; Satellite imagery; Spatial prediction.

MeSH terms

  • Algorithms
  • Climate
  • Cyclonic Storms*
  • Floods
  • Plant Weeds
  • Random Forest*