Harmonized Genome Wide Typing of Tubercle Bacilli Using a Web-Based Gene-By-Gene Nomenclature System

EBioMedicine. 2018 Aug:34:131-138. doi: 10.1016/j.ebiom.2018.07.030. Epub 2018 Aug 13.

Abstract

Background: Global tuberculosis (TB) control is challenged by uncontrolled transmission of Mycobacterium tuberculosis complex (Mtbc) strains, esp. of multidrug (MDR) or extensively resistant (XDR) variants. Precise analysis of transmission networks is the basis to trace outbreak M/XDR clones and improve TB control. However, classical genotyping tools lack discriminatory power due to the high similarity of strains of particular successful lineages, e.g. Beijing or outbreak strains. This can be overcome by whole genome sequencing (WGS) approaches, but these are not yet standardized to facilitate larger investigations encompassing different laboratories or outbreak tracing across borders.

Methods: We established and improved a whole genome gene-by-gene multi locus sequence typing approach encompassing a stable set of core genome genes (cgMLST) and linked it to a web-based nomenclature server (cgMLST.org) facilitating assignment and storage of allele numbers.

Findings: We evaluated and refined a previously suggested cgMLST schema by using a reference strain set (n = 251) reflecting the global diversity of the Mtbc. A set of 2891 genes showed excellent performance with at least 97% of the genes reliably identified in strains of all Mtbc lineages and in discriminating outbreak strains. cgMLST allele numbers were automatically retrieved from and stored at cgMLST.org.

Interpretation: The refined cgMLST schema provides high resolution genome-based typing of clinical strains of all Mtbc lineages. Combined with a web-based nomenclature server, it facilitates rapid, high-resolution, and harmonized tracing of clinical Mtbc strains needed for prospective local and global surveillance.

Keywords: Core genome MLST; Genotyping; Molecular epidemiology; Mycobacterium tuberculosis; Whole genome sequencing.

MeSH terms

  • Bacterial Typing Techniques*
  • Drug Resistance, Bacterial
  • Drug Resistance, Multiple
  • Genome, Bacterial*
  • Genotype
  • Internet
  • Multilocus Sequence Typing*
  • Mycobacterium tuberculosis / classification*
  • Mycobacterium tuberculosis / genetics*
  • Polymorphism, Single Nucleotide
  • Whole Genome Sequencing