Support vector machine applied to predict the zoonotic potential of E. coli O157 cattle isolates

Proc Natl Acad Sci U S A. 2016 Oct 4;113(40):11312-11317. doi: 10.1073/pnas.1606567113. Epub 2016 Sep 19.

Abstract

Sequence analyses of pathogen genomes facilitate the tracking of disease outbreaks and allow relationships between strains to be reconstructed and virulence factors to be identified. However, these methods are generally used after an outbreak has happened. Here, we show that support vector machine analysis of bovine E. coli O157 isolate sequences can be applied to predict their zoonotic potential, identifying cattle strains more likely to be a serious threat to human health. Notably, only a minor subset (less than 10%) of bovine E. coli O157 isolates analyzed in our datasets were predicted to have the potential to cause human disease; this is despite the fact that the majority are within previously defined pathogenic lineages I or I/II and encode key virulence factors. The predictive capacity was retained when tested across datasets. The major differences between human and bovine E. coli O157 isolates were due to the relative abundances of hundreds of predicted prophage proteins. This finding has profound implications for public health management of disease because interventions in cattle, such a vaccination, can be targeted at herds carrying strains of high zoonotic potential. Machine-learning approaches should be applied broadly to further our understanding of pathogen biology.

Keywords: E. coli; Shiga toxin; cattle; machine learning; zoonosis.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Cattle
  • Cattle Diseases / epidemiology
  • Cattle Diseases / microbiology*
  • Disease Outbreaks
  • Escherichia coli Infections / epidemiology
  • Escherichia coli Infections / microbiology*
  • Escherichia coli O157 / isolation & purification*
  • Humans
  • Phylogeny
  • Shiga Toxin 2 / metabolism
  • Support Vector Machine*
  • United Kingdom / epidemiology
  • Zoonoses / epidemiology
  • Zoonoses / microbiology*

Substances

  • Shiga Toxin 2