Deep learning time series prediction models in surveillance data of hepatitis incidence in China

PLoS One. 2022 Apr 13;17(4):e0265660. doi: 10.1371/journal.pone.0265660. eCollection 2022.

Abstract

Background: Precise incidence prediction of Hepatitis infectious disease is critical for early prevention and better government strategic planning. In this paper, we presented different prediction models using deep learning methods based on the monthly incidence of Hepatitis through a national public health surveillance system in China mainland.

Methods: We assessed and compared the performance of three deep learning methods, namely, Long Short-Term Memory (LSTM) prediction model, Recurrent Neural Network (RNN) prediction model, and Back Propagation Neural Network (BPNN) prediction model. The data collected from 2005 to 2018 were used for the training and prediction model, while the data are split via 5-Fold cross-validation. The performance was evaluated based on three metrics: mean square error (MSE), mean absolute error (MAE), and mean absolute percentage error (MAPE).

Results: Among the year 2005-2018, 20,924,951 cases and 11,892 deaths were supervised in the system. Hepatitis B (HB) is the most disease-causing incidence and death, and the proportion is greater than 70 percent, while the percentage of the incidence and deaths is decreased much in 2018 compared with 2005. Based on the measured errors and the visualization of the three neural networks, there is no one model predicting the incidence cases that can be completely superior to other models. When predicting the number of incidence cases for HB, the performance ranking of the three models from high to low is LSTM, BPNN, RNN, while it is LSTM, RNN, BPNN for Hepatitis C (HC). while the MAE, MSE and MAPE of the LSTM model for HB, HC are 3.84*10-06, 3.08*10-11, 4.981, 8.84*10-06, 1.98*10-12,5.8519, respectively.

Conclusions: The deep learning time series predictive models show their significance to forecast the Hepatitis incidence and have the potential to assist the decision-makers in making efficient decisions for the early detection of the disease incidents, which would significantly promote Hepatitis disease control and management.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • China / epidemiology
  • Deep Learning*
  • Forecasting
  • Hepatitis*
  • Humans
  • Incidence
  • Time Factors

Grants and funding

This research was supported by the National Natural Science Foundation of China (52005192) and the Fundamental Research Funds for the Central Universities (HUST: 5003100089). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.