Comparing and linking machine learning and semi-mechanistic models for the predictability of endemic measles dynamics

PLoS Comput Biol. 2022 Sep 8;18(9):e1010251. doi: 10.1371/journal.pcbi.1010251. eCollection 2022 Sep.

Abstract

Measles is one the best-documented and most-mechanistically-studied non-linear infectious disease dynamical systems. However, systematic investigation into the comparative performance of traditional mechanistic models and machine learning approaches in forecasting the transmission dynamics of this pathogen are still rare. Here, we compare one of the most widely used semi-mechanistic models for measles (TSIR) with a commonly used machine learning approach (LASSO), comparing performance and limits in predicting short to long term outbreak trajectories and seasonality for both regular and less regular measles outbreaks in England and Wales (E&W) and the United States. First, our results indicate that the proposed LASSO model can efficiently use data from multiple major cities and achieve similar short-to-medium term forecasting performance to semi-mechanistic models for E&W epidemics. Second, interestingly, the LASSO model also captures annual to biennial bifurcation of measles epidemics in E&W caused by susceptible response to the late 1940s baby boom. LASSO may also outperform TSIR for predicting less-regular dynamics such as those observed in major cities in US between 1932-45. Although both approaches capture short-term forecasts, accuracy suffers for both methods as we attempt longer-term predictions in highly irregular, post-vaccination outbreaks in E&W. Finally, we illustrate that the LASSO model can both qualitatively and quantitatively reconstruct mechanistic assumptions, notably susceptible dynamics, in the TSIR model. Our results characterize the limits of predictability of infectious disease dynamics for strongly immunizing pathogens with both mechanistic and machine learning models, and identify connections between these two approaches.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Communicable Diseases* / epidemiology
  • Disease Outbreaks
  • Epidemics*
  • Humans
  • Machine Learning
  • Measles* / epidemiology
  • United States / epidemiology

Grants and funding

This work is supported by the Dean's Pilot and Innovation Awards provided by the Dean’s office at Rollins School of Public Health at Emory University. B.G and C.J.M gratefully acknowledge financial support from the Schmidt DataX Fund at Princeton University made possible through a major gift from the Schmidt Futures Foundation. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.