Statistical identifiability and sample size calculations for serial seroepidemiology

Epidemics. 2015 Sep:12:30-9. doi: 10.1016/j.epidem.2015.02.005. Epub 2015 Mar 3.

Abstract

Inference on disease dynamics is typically performed using case reporting time series of symptomatic disease. The inferred dynamics will vary depending on the reporting patterns and surveillance system for the disease in question, and the inference will miss mild or underreported epidemics. To eliminate the variation introduced by differing reporting patterns and to capture asymptomatic or subclinical infection, inferential methods can be applied to serological data sets instead of case reporting data. To reconstruct complete disease dynamics, one would need to collect a serological time series. In the statistical analysis presented here, we consider a particular kind of serological time series with repeated, periodic collections of population-representative serum. We refer to this study design as a serial seroepidemiology (SSE) design, and we base the analysis on our epidemiological knowledge of influenza. We consider a study duration of three to four years, during which a single antigenic type of influenza would be circulating, and we evaluate our ability to reconstruct disease dynamics based on serological data alone. We show that the processes of reinfection, antibody generation, and antibody waning confound each other and are not always statistically identifiable, especially when dynamics resemble a non-oscillating endemic equilibrium behavior. We introduce some constraints to partially resolve this confounding, and we show that transmission rates and basic reproduction numbers can be accurately estimated in SSE study designs. Seasonal forcing is more difficult to identify as serology-based studies only detect oscillations in antibody titers of recovered individuals, and these oscillations are typically weaker than those observed for infected individuals. To accurately estimate the magnitude and timing of seasonal forcing, serum samples should be collected every two months and 200 or more samples should be included in each collection; this sample size estimate is sensitive to the antibody waning rate and the assumed level of seasonal forcing.

Keywords: Antibody waning; Complete disease dynamics; Influenza; Maximum likelihood; Serial seroepidemiology; Seroepidemiology; Statistical identifiability.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Antibodies / blood
  • Disease Transmission, Infectious
  • Epidemiologic Methods*
  • Humans
  • Likelihood Functions
  • Models, Theoretical
  • Sample Size*
  • Seasons
  • Seroepidemiologic Studies*
  • Statistics as Topic
  • Time Factors

Substances

  • Antibodies