The Prediction of Influenza-like Illness and Respiratory Disease Using LSTM and ARIMA

Int J Environ Res Public Health. 2022 Feb 7;19(3):1858. doi: 10.3390/ijerph19031858.

Abstract

This paper proposed the forecasting model of Influenza-like Illness (ILI) and respiratory disease. The dataset was extracted from the Taiwan Environmental Protection Administration (EPA) for air pollutants data and the Centers for Disease Control (CDC) for disease cases from 2009 to 2018. First, this paper applied the ARIMA method, which trained based on the weekly number of disease cases in time series. Second, we implemented the Long short-term memory (LSTM) method, which trained based on the correlation between the weekly number of diseases and air pollutants. The models were also trained and evaluated based on five and ten years of historical data. Autoregressive integrated moving average (ARIMA) has an excellent model in the five-year dataset of ILI at 2564.9 compared to ten years at 8173.6 of RMSE value. This accuracy is similar to the Respiratory dataset, which gets 15,656.7 in the five-year dataset and 22,680.4 of RMSE value in the ten-year dataset. On the contrary, LSTM has better accuracy in the ten-year dataset than the five-year dataset. For example, on average of RMSE in the ILI dataset, LSTM has 720.2 RMSE value in five years and 517.0 in ten years dataset. Also, in the Respiratory disease dataset, LSTM gets 4768.6 of five years of data and 3254.3 of the ten-year dataset. These experiments revealed that the LSTM model generally outperforms ARIMA by three to seven times higher model performance.

Keywords: ARIMA; ILI; LSTM; PM2.5; air pollution; influenza-like illness; respiratory disease.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Air Pollutants*
  • Forecasting
  • Humans
  • Influenza, Human* / epidemiology
  • Respiration Disorders*
  • Taiwan / epidemiology

Substances

  • Air Pollutants