Phylogenetic Analysis of Mycobacterium tuberculosis Strains in Wales by Use of Core Genome Multilocus Sequence Typing To Analyze Whole-Genome Sequencing Data

J Clin Microbiol. 2019 May 24;57(6):e02025-18. doi: 10.1128/JCM.02025-18. Print 2019 Jun.

Abstract

An inability to standardize the bioinformatic data produced by whole-genome sequencing (WGS) has been a barrier to its widespread use in tuberculosis phylogenetics. The aim of this study was to carry out a phylogenetic analysis of tuberculosis in Wales, United Kingdom, using Ridom SeqSphere software for core genome multilocus sequence typing (cgMLST) analysis of whole-genome sequencing data. The phylogenetics of tuberculosis in Wales have not previously been studied. Sixty-six Mycobacterium tuberculosis isolates (including 42 outbreak-associated isolates) from south Wales were sequenced using an Illumina platform. Isolates were assigned to principal genetic groups, single nucleotide polymorphism (SNP) cluster groups, lineages, and sublineages using SNP-calling protocols. WGS data were submitted to the Ridom SeqSphere software for cgMLST analysis and analyzed alongside 179 previously lineage-defined isolates. The data set was dominated by the Euro-American lineage, with the sublineage composition being dominated by T, X, and Haarlem family strains. The cgMLST analysis successfully assigned 58 isolates to major lineages, and the results were consistent with those obtained by traditional SNP mapping methods. In addition, the cgMLST scheme was used to resolve an outbreak of tuberculosis occurring in the region. This study supports the use of a cgMLST method for standardized phylogenetic assignment of tuberculosis isolates and for outbreak resolution and provides the first insight into Welsh tuberculosis phylogenetics, identifying the presence of the Haarlem sublineage commonly associated with virulent traits.

Keywords: Mycobacterium tuberculosis; outbreak; phylogenetics; tuberculosis; whole-genome sequencing.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Disease Outbreaks
  • Genome, Bacterial*
  • Genotype
  • Humans
  • Molecular Epidemiology
  • Multilocus Sequence Typing*
  • Mycobacterium tuberculosis / classification*
  • Mycobacterium tuberculosis / genetics*
  • Mycobacterium tuberculosis / isolation & purification
  • Phylogeny
  • Polymorphism, Single Nucleotide
  • Tuberculosis / epidemiology*
  • Tuberculosis / microbiology*
  • Wales / epidemiology
  • Whole Genome Sequencing*