Inferring global-scale temporal latent topics from news reports to predict public health interventions for COVID-19

Patterns (N Y). 2022 Mar 11;3(3):100435. doi: 10.1016/j.patter.2022.100435. Epub 2022 Feb 1.

Abstract

The COVID-19 pandemic has highlighted the importance of non-pharmacological interventions (NPIs) for controlling epidemics of emerging infectious diseases. Despite their importance, NPIs have been monitored mainly through the manual efforts of volunteers. This approach hinders measurement of the NPI effectiveness and development of evidence to guide their use to control the global pandemic. We present EpiTopics, a machine learning approach to support automation of NPI prediction and monitoring at both the document level and country level by mining the vast amount of unlabeled news reports on COVID-19. EpiTopics uses a 3-stage, transfer-learning algorithm to classify documents according to NPI categories, relying on topic modeling to support result interpretation. We identified 25 interpretable topics under 4 distinct and coherent COVID-related themes. Importantly, the use of these topics resulted in significant improvements over alternative automated methods in predicting the NPIs in labeled documents and in predicting country-level NPIs for 42 countries.

Keywords: COVID-19; latent topic models; non-pharmacological interventions; public health surveillance; transfer learning; variational autoencoder.

Publication types

  • News