Statistical methods for predicting tuberculosis incidence based on data from Guangxi, China

BMC Infect Dis. 2020 Apr 22;20(1):300. doi: 10.1186/s12879-020-05033-3.

Abstract

Background: Tuberculosis (TB) remains a serious public health problem with substantial financial burden in China. The incidence of TB in Guangxi province is much higher than that in the national level, however, there is no predictive study of TB in recent years in Guangxi, therefore, it is urgent to construct a model to predict the incidence of TB, which could provide help for the prevention and control of TB.

Methods: Box-Jenkins model methods have been successfully applied to predict the incidence of infectious disease. In this study, based on the analysis of TB incidence in Guangxi from January 2012 to June 2019, we constructed TB prediction model by Box-Jenkins methods, and used root mean square error (RMSE), mean absolute error (MAE) and mean absolute percentage error (MAPE) to test the performance and prediction accuracy of model.

Results: From January 2012 to June 2019, a total of 587,344 cases of TB were reported and 879 cases died in Guangxi. Based on TB incidence from January 2012 to December 2018, the SARIMA((2),0,(2))(0,1,0)12 model was established, the AIC and SC of this model were 2.87 and 2.98, the fitting accuracy indexes, such as RMSE, MAE and MAPE were 0.98, 0.77 and 5.8 respectively; the prediction accuracy indexes, such as RMSE, MAE and MAPE were 0.62, 0.45 and 3.77, respectively. Based on the SARIMA((2),0,(2))(0,1,0)12 model, we predicted the TB incidence in Guangxi from July 2019 to December 2020.

Conclusions: This study filled the gap in the prediction of TB incidence in Guangxi in recent years. The established SARIMA((2),0,(2))(0,1,0)12 model has high prediction accuracy and good prediction performance. The results suggested the change trend of TB incidence predicted by SARIMA((2),0,(2))(0,1,0)12 model from July 2019 to December 2020 was similar to that in the previous two years, and TB incidence will experience slight decrease, the predicted results can provide scientific reference for the prevention and control of TB in Guangxi, China.

Keywords: Prediction; SARIMA model; Tuberculosis.

MeSH terms

  • China / epidemiology
  • Datasets as Topic
  • Forecasting
  • Humans
  • Incidence
  • Models, Statistical*
  • Public Health / statistics & numerical data
  • Seasons
  • Software
  • Tuberculosis / epidemiology*
  • Tuberculosis / mortality