Dengue prediction by the web: Tweets are a useful tool for estimating and forecasting Dengue at country and city level

PLoS Negl Trop Dis. 2017 Jul 18;11(7):e0005729. doi: 10.1371/journal.pntd.0005729. eCollection 2017 Jul.

Abstract

Background: Infectious diseases are a leading threat to public health. Accurate and timely monitoring of disease risk and progress can reduce their impact. Mentioning a disease in social networks is correlated with physician visits by patients, and can be used to estimate disease activity. Dengue is the fastest growing mosquito-borne viral disease, with an estimated annual incidence of 390 million infections, of which 96 million manifest clinically. Dengue burden is likely to increase in the future owing to trends toward increased urbanization, scarce water supplies and, possibly, environmental change. The epidemiological dynamic of Dengue is complex and difficult to predict, partly due to costly and slow surveillance systems.

Methodology / principal findings: In this study, we aimed to quantitatively assess the usefulness of data acquired by Twitter for the early detection and monitoring of Dengue epidemics, both at country and city level at a weekly basis. Here, we evaluated and demonstrated the potential of tweets modeling for Dengue estimation and forecast, in comparison with other available web-based data, Google Trends and Wikipedia access logs. Also, we studied the factors that might influence the goodness-of-fit of the model. We built a simple model based on tweets that was able to 'nowcast', i.e. estimate disease numbers in the same week, but also 'forecast' disease in future weeks. At the country level, tweets are strongly associated with Dengue cases, and can estimate present and future Dengue cases until 8 weeks in advance. At city level, tweets are also useful for estimating Dengue activity. Our model can be applied successfully to small and less developed cities, suggesting a robust construction, even though it may be influenced by the incidence of the disease, the activity of Twitter locally, and social factors, including human development index and internet access.

Conclusions: Tweets association with Dengue cases is valuable to assist traditional Dengue surveillance at real-time and low-cost. Tweets are able to successfully nowcast, i.e. estimate Dengue in the present week, but also forecast, i.e. predict Dengue at until 8 weeks in the future, both at country and city level with high estimation capacity.

Publication types

  • Comparative Study
  • Evaluation Study

MeSH terms

  • Dengue / epidemiology*
  • Dengue / transmission
  • Epidemiologic Methods*
  • Forecasting
  • Humans
  • Internet*
  • Models, Statistical
  • Social Media*

Grants and funding

This study was supported by the Instituto Nacional de Ciencia e Tecnologia (INCT) em Dengue (CNPq/FAPEMIG 573876/2008-08; http://labs.icb.ufmg.br/inctemdengue) and the Pan American Health Organization (PAHO) (BR/LOA/1200073.001; http://www.paho.org/bra). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.