Learning inter-annual flood loss risk models from historical flood insurance claims

J Environ Manage. 2023 Dec 1:347:118862. doi: 10.1016/j.jenvman.2023.118862. Epub 2023 Oct 6.

Abstract

Flooding is a natural hazard that causes substantial loss of lives and livelihoods worldwide. Developing predictive models for flood-induced financial losses is crucial for applications such as insurance underwriting. This research uses the National Flood Insurance Program (NFIP) dataset between 2000 and 2020 to evaluate the predictive skill of past data in predicting near-future flood loss risk. Our approach applies neural networks (Conditional Generative Adversarial Networks), decision trees (Extreme Gradient Boosting), and kernel-based regressors (Gaussian Processes) to estimate pointwise losses. It aggregates them over intervals using a bias-corrected Burr-Pareto distribution to predict risk. The regression models help identify the most informative predictors and highlight crucial factors influencing flood-related financial losses. Applying our approach to quantify the county-level coastal flood loss risk in eight US Southern states results in an R2=0.807, substantially outperforming related work using stage-damage curves. More detailed testing on 11 counties with significant claims in the NFIP dataset reveals that Extreme Gradient Boosting yields the most favorable results, and bias correction significantly improves the similarity between the predicted and reference claim amount distributions. Our experiments also show that, despite the already experienced climate change, the difference in future short-term risk predictions of flood-loss amounts between historical shifting or expanding training data windows is insignificant.

Keywords: Bias correction; Climate change; Extreme Gradient Boosting; Feature selection; Flood loss; Gaussian Processes; Generative Adversarial Networks; NFIP dataset; Natural hazards.

MeSH terms

  • Climate Change
  • Floods*
  • Forecasting
  • Insurance*