Robustness assessment of whole bacterial genome segmentations

J Comput Biol. 2011 Sep;18(9):1155-65. doi: 10.1089/cmb.2011.0115.

Abstract

Comparison of closely related bacterial genomes has revealed the presence of highly conserved sequences forming a "backbone" that is interrupted by numerous, less conserved, DNA fragments. Segmentation of bacterial genomes into backbone and variable regions is particularly useful to investigate, among other things, bacterial genome evolution. Several software tools have been designed to compare complete bacterial chromosomes and a few online databases store pre-computed genome comparisons. However, very few statistical methods are available to evaluate the reliability of these software tools and to compare the results obtained with them. To fill this gap, we have developed two local scores to measure the robustness of bacterial genome segmentations. Our method uses a simulation procedure based on random perturbations of the compared genomes. The two scores described in this article provide useful information and are easy to implement, and their interpretation is intuitive. We show that they are suited to discriminate between robust and non-robust segmentations when genome aligners such as MAUVE and MGA are used.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Calibration
  • Conserved Sequence*
  • Escherichia coli / genetics
  • Genetic Variation
  • Genome, Bacterial*
  • Models, Genetic
  • Pseudomonas syringae / genetics
  • Sequence Alignment / methods*
  • Sequence Analysis, DNA / methods*
  • Streptococcus pyogenes / genetics