Accurate estimation of influenza epidemics using Google search data via ARGO

Proc Natl Acad Sci U S A. 2015 Nov 24;112(47):14473-8. doi: 10.1073/pnas.1515373112. Epub 2015 Nov 9.

Abstract

Accurate real-time tracking of influenza outbreaks helps public health officials make timely and meaningful decisions that could save lives. We propose an influenza tracking model, ARGO (AutoRegression with GOogle search data), that uses publicly available online search data. In addition to having a rigorous statistical foundation, ARGO outperforms all previously available Google-search-based tracking models, including the latest version of Google Flu Trends, even though it uses only low-quality search data as input from publicly available Google Trends and Google Correlate websites. ARGO not only incorporates the seasonality in influenza epidemics but also captures changes in people's online search behavior over time. ARGO is also flexible, self-correcting, robust, and scalable, making it a potentially powerful tool that can be used for real-time tracking of other social events at multiple temporal and spatial resolutions.

Keywords: autoregressive exogenous model; big data; digital disease detection; influenza-like illnesses activity real-time estimation; seasonal influenza.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Epidemics*
  • Humans
  • Influenza, Human / epidemiology*
  • Internet
  • Retrospective Studies
  • Search Engine