Modeling and forecasting the COVID-19 pandemic time-series data

Soc Sci Q. 2021 Sep;102(5):2070-2087. doi: 10.1111/ssqu.13008. Epub 2021 Aug 7.

Abstract

Objective: We analyze the number of recorded cases and deaths of COVID-19 in many parts of the world, with the aim to understand the complexities of the data, and produce regular forecasts.

Methods: The SARS-CoV-2 virus that causes COVID-19 has affected societies in all corners of the globe but with vastly differing experiences across countries. Health-care and economic systems vary significantly across countries, as do policy responses, including testing, intermittent lockdowns, quarantine, contact tracing, mask wearing, and social distancing. Despite these challenges, the reported data can be used in many ways to help inform policy. We describe how to decompose the reported time series of confirmed cases and deaths into a trend, seasonal, and irregular component using machine learning methods.

Results: This decomposition enables statistical computation of measures of the mortality ratio and reproduction number for any country, and we conduct a counterfactual exercise assuming that the United States had a summer outcome in 2020 similar to that of the European Union. The decomposition is also used to produce forecasts of cases and deaths, and we undertake a forecast comparison which highlights the importance of seasonality in the data and the difficulties of forecasting too far into the future.

Conclusion: Our adaptive data-based methods and purely statistical forecasts provide a useful complement to the output from epidemiological models.

Keywords: Covid‐19; epidemiology; nonstationarity; reproduction number; time‐series forecasting.