Inferring social network structure from bacterial sequence data

PLoS One. 2011;6(8):e22685. doi: 10.1371/journal.pone.0022685. Epub 2011 Aug 1.

Abstract

Using DNA sequence data from pathogens to infer transmission networks has traditionally been done in the context of epidemics and outbreaks. Sequence data could analogously be applied to cases of ubiquitous commensal bacteria; however, instead of inferring chains of transmission to track the spread of a pathogen, sequence data for bacteria circulating in an endemic equilibrium could be used to infer information about host contact networks. Here, we show--using simulated data--that multilocus DNA sequence data, based on multilocus sequence typing schemes (MLST), from isolates of commensal bacteria can be used to infer both local and global properties of the contact networks of the populations being sampled. Specifically, for MLST data simulated from small-world networks, the small world parameter controlling the degree of structure in the contact network can robustly be estimated. Moreover, we show that pairwise distances in the network--degrees of separation--correlate with genetic distances between isolates, so that how far apart two individuals in the network are can be inferred from MLST analysis of their commensal bacteria. This result has important consequences, and we show an example from epidemiology: how this result could be used to test for infectious origins of diseases of unknown etiology.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Bacteria / genetics*
  • DNA, Bacterial / genetics*
  • Likelihood Functions
  • Social Support*

Substances

  • DNA, Bacterial