Evaluation of whole-genome sequencing for outbreak detection of Verotoxigenic Escherichia coli O157:H7 from the Canadian perspective

BMC Genomics. 2018 Dec 4;19(1):870. doi: 10.1186/s12864-018-5243-3.

Abstract

Background: Rapid and accurate identification of Verotoxigenic Escherichia coli (VTEC) O157:H7 is dependent on well-established, standardized and highly discriminatory typing methods. Currently, conventional subtyping tests for foodborne bacterial pathogen surveillance are rapidly being replaced with whole-genome sequencing (WGS) in public health laboratories. The capacity of WGS to revolutionize global foodborne disease surveillance has positioned this tool to become the new gold standard; however, to ensure evidence standards for public health decision making can still be achieved, the performance of WGS must be thoroughly validated against current gold standard methods prior to implementation. Here we aim to verify the performance of WGS in comparison to pulsed-field gel electrophoresis (PFGE) and multiple-locus variable-number tandem repeat analysis (MLVA) for eight retrospective outbreaks of VTEC O157:H7 from the Canadian perspective. Since real-time implementation and routine use of WGS in public health laboratories is highly reliant on standardized data analysis tools, we also provide a comparative analysis of two popular methodologies for WGS analyses; an in-house developed single nucleotide variant phylogenomics (SNVPhyl) pipeline and the BioNumerics whole genome multilocus sequence typing (wgMLST) tool. To provide a useful and consistent starting point for examining laboratory-based surveillance data for VTEC O157:H7 in Canada, we also aim to describe the number of genetic differences observed among outbreak-associated isolates.

Results: WGS provided enhanced resolution over traditional subtyping methods, and accurately distinguished outbreak-related isolates from non-outbreak related isolates with high epidemiological concordance. WGS also illuminated potential linkages between sporadic cases of illness and contaminated food, and isolates spanning multiple years. The topologies generated by SNVPhyl and wgMLST were highly congruent with strong statistical support. Few genetic differences were observed among outbreak-related isolates (≤5 SNVs/ < 10 wgMLST alleles) unless the outbreak was suspected to be multi-strain.

Conclusions: This study validates the superiority of WGS and indicates the BioNumerics wgMLST schema is suitable for surveillance and cluster detection of VTEC O157:H7. These findings will provide a useful and consistent starting point for examining WGS data for prospective laboratory-based surveillance of VTEC O157:H7, but however, the data will continue to be interpreted according to context and in combination with epidemiological and food safety evidence to inform public-health decision making in Canada.

Keywords: Escherichia coli; O157:H7; Outbreak; SNVPhyl; VTEC; Whole genome sequencing; wgMLST.

Publication types

  • Evaluation Study

MeSH terms

  • Canada / epidemiology
  • Cluster Analysis
  • DNA, Bacterial / chemistry
  • DNA, Bacterial / isolation & purification
  • DNA, Bacterial / metabolism
  • Disease Outbreaks
  • Electrophoresis, Gel, Pulsed-Field
  • Escherichia coli Infections / epidemiology
  • Escherichia coli Infections / microbiology*
  • Escherichia coli Infections / pathology
  • Escherichia coli O157 / genetics*
  • Escherichia coli O157 / isolation & purification
  • Genetic Variation
  • Humans
  • Multilocus Sequence Typing
  • Polymorphism, Single Nucleotide
  • Retrospective Studies
  • Shiga-Toxigenic Escherichia coli / genetics*
  • Shiga-Toxigenic Escherichia coli / isolation & purification
  • Whole Genome Sequencing / methods*

Substances

  • DNA, Bacterial