Cassette-like variation of restriction enzyme genes in Escherichia coli C and relatives

Nucleic Acids Res. 2004 Jan 26;32(2):522-34. doi: 10.1093/nar/gkh194. Print 2004.

Abstract

A surprising result of comparative bacterial genomics has been the large amount of DNA found to be present in one strain but not in another of the same species. We examine in detail one location where gene content varies extensively, the restriction cluster in Escherichia coli. This region is designated the Immigration Control Region (ICR) for the density and variability of restriction functions found there. To better define the boundaries of this variable locus, we determined the sequence of the region from a restrictionless strain, E.coli C. Here we compare the 13.7 kb E.coli C sequence spanning the site of the ICR with corresponding sequences from five E.coli strains and Salmonella typhimurium LT2. To discuss this variation, we adopt the term 'framework' to refer to genes that are stable components of genomes within related lineages, while 'migratory' genes are transient inhabitants of the genome. Strikingly, seven different migratory DNA segments, encoding different sets of genes and gene fragments, alternatively occupy a single well-defined location in the seven strains examined. The flanking framework genes, yjiS and yjiA, display approximately normal patterns of conservation. The patterns observed are consistent with the action of a site-specific recombinase. Since no nearby gene codes for a likely recombinase of known families, such a recombinase must be of a new family or unlinked.

MeSH terms

  • Base Sequence
  • Contig Mapping
  • DNA Restriction Enzymes / genetics*
  • DNA, Bacterial / genetics
  • Escherichia coli / classification*
  • Escherichia coli / enzymology
  • Escherichia coli / genetics*
  • Escherichia coli Proteins / genetics
  • GTP Phosphohydrolases / genetics
  • Genes, Bacterial / genetics*
  • Genetic Variation / genetics*
  • Genome, Bacterial
  • Genomics*
  • Molecular Sequence Data
  • Phylogeny
  • Regulatory Sequences, Nucleic Acid / genetics
  • Salmonella typhimurium / enzymology
  • Salmonella typhimurium / genetics
  • Sequence Alignment
  • Sequence Homology, Nucleic Acid

Substances

  • DNA, Bacterial
  • Escherichia coli Proteins
  • DNA Restriction Enzymes
  • GTP Phosphohydrolases
  • YjiA protein, E coli

Associated data

  • GENBANK/AY392450