Short-term forecasting of COVID-19 using support vector regression: An application using Zimbabwean data

Am J Infect Control. 2023 Oct;51(10):1095-1107. doi: 10.1016/j.ajic.2023.03.010. Epub 2023 Mar 30.

Abstract

Background: This study aims to show that including pairwise hierarchical interactions of covariates and combining forecasts from individual models improves prediction accuracy.

Methods: The least absolute shrinkage and selection operator via hierarchical pairwise interaction is used in selecting variables that are not correlated and with the greatest predictive power in single forecast models (Gradient boosting method [GBM], Generalized additive models [GAMs], Support vector regression [SVR]) are used in the analysis. The best model was selected based on the mean absolute error (MAE), the best key performance indicator for skewed data. Forecasts from the 5 models were combined using linear quantile regression averaging (LQRA). Box and Whiskers plots are used to diagnose the overall performance of fitted models.

Results: Single forecast models (GBM, GAMs, and SVRs) show that including pairwise interactions improves forecast accuracy. The SVR model with interactions based on the radial basis kernel function is the best from single forecast models with the lowest MAE. Combining point forecasts from all the single forecast models using the LQRA approach further reduces the MAE. However, based on the Box and Whiskers plot, the SVR model with pairwise interactions has the smallest range.

Conclusions: Based on the key performance indicators, combining predictions from several individual models improves forecast accuracy. However, overall, the SVM with pairwise hierarchical interactions outperforms all the other models.

Keywords: Generalized additive model; Hierarchical interactions; LASSO; Least absolute shrinkage; Linear quantile regression averaging; Support vector regression.

MeSH terms

  • Algorithms
  • COVID-19*
  • Humans
  • Linear Models
  • Zimbabwe / epidemiology