Agglomerative Clustering of Enteric Infections and Weather Parameters to Identify Seasonal Outbreaks in Cold Climates

Int J Environ Res Public Health. 2019 Jun 12;16(12):2083. doi: 10.3390/ijerph16122083.

Abstract

The utility of agglomerative clustering methods for understanding dynamic systems that do not have a well-defined periodic structure has not yet been explored. We propose using this approach to examine the association between disease and weather parameters, to compliment the traditional harmonic regression models, and to determine specific meteorological conditions favoring high disease incidence. We utilized daily records on reported salmonellosis and non-specific enteritis, and four meteorological parameters (ambient temperature, dew point, humidity, and barometric pressure) in Barnaul, Russia in 2004-2011, maintained by the CliWaDIn database. The data structure was examined using the t-distributed stochastic neighbor embedding (t-SNE) method. The optimal number of clusters was selected based on Ward distance using the silhouette metric. The selected clusters were assessed with respect to their density and homogeneity. We detected that a well-defined cluster with high counts of salmonellosis occurred during warm summer days and unseasonably warm days in spring. We also detected a cluster with high counts of non-specific enteritis that occurred during unusually "very warm" winter days. The main advantage offered by the proposed technique is its ability to create a composite of meteorological conditions-a rule of thumb-to detect days favoring infectious outbreaks for a given location. These findings have major implications for understanding potential health impacts of climate change.

Keywords: agglomerative clustering; climate change; harmonic regression models; machine learning; meteorological parameters; non-specific enteric infections; salmonellosis; seasonality; t-SNE method.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Atmospheric Pressure*
  • Cluster Analysis*
  • Cold Climate*
  • Disease Outbreaks / statistics & numerical data*
  • Gastrointestinal Diseases / epidemiology
  • Gastrointestinal Microbiome*
  • Humans
  • Humidity*
  • Russia / epidemiology
  • Seasons*