SpolSimilaritySearch - A web tool to compare and search similarities between spoligotypes of Mycobacterium tuberculosis complex

Tuberculosis (Edinb). 2017 Jul:105:49-52. doi: 10.1016/j.tube.2017.04.007. Epub 2017 Apr 20.

Abstract

Spoligotyping is one of the most commonly used polymerase chain reaction (PCR)-based methods for identification and study of genetic diversity of Mycobacterium tuberculosis complex (MTBC). Despite its known limitations if used alone, the methodology is particularly useful when used in combination with other methods such as mycobacterial interspersed repetitive units - variable number of tandem DNA repeats (MIRU-VNTRs). At a worldwide scale, spoligotyping has allowed identification of information on 103,856 MTBC isolates (corresponding to 98049 clustered strains plus 5807 unique isolates from 169 countries of patient origin) contained within the SITVIT2 proprietary database of the Institut Pasteur de la Guadeloupe. The SpolSimilaritySearch web-tool described herein (available at: http://www.pasteur-guadeloupe.fr:8081/SpolSimilaritySearch) incorporates a similarity search algorithm allowing users to get a complete overview of similar spoligotype patterns (with information on presence or absence of 43 spacers) in the aforementioned worldwide database. This tool allows one to analyze spread and evolutionary patterns of MTBC by comparing similar spoligotype patterns, to distinguish between widespread, specific and/or confined patterns, as well as to pinpoint patterns with large deleted blocks, which play an intriguing role in the genetic epidemiology of M. tuberculosis. Finally, the SpolSimilaritySearch tool also provides with the country distribution patterns for each queried spoligotype.

Keywords: Database; Evolution; Molecular epidemiology; Mycobacterium tuberculosis; Similarity search; Spoligotyping.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • DNA, Bacterial / genetics*
  • Data Mining / methods*
  • Databases, Genetic*
  • Genetic Variation
  • Genotype
  • Humans
  • Interspersed Repetitive Sequences
  • Minisatellite Repeats
  • Molecular Epidemiology / methods*
  • Mycobacterium tuberculosis / genetics*
  • Pattern Recognition, Automated
  • Polymerase Chain Reaction / methods*
  • Tuberculosis / diagnosis
  • Tuberculosis / epidemiology
  • Tuberculosis / microbiology*

Substances

  • DNA, Bacterial