Determination of single nucleotide variants in Escherichia coli DH5α by using short-read sequencing

FEMS Microbiol Lett. 2015 Jun;362(11):fnv073. doi: 10.1093/femsle/fnv073. Epub 2015 Apr 30.

Abstract

Escherichia coli DH5α is a common laboratory strain that provides an important platform for routine use in cloning and synthetic biology applications. Many synthetic circuits have been constructed and successfully expressed in E. coli DH5α; however, its genome sequence has not been determined yet. Here, we determined E. coli DH5α genome sequence and identified genetic mutations that affect its phenotypic functions by using short-read sequencing. The sequencing results clearly described the genotypes of E. coli DH5α, which aid in further studies using the strain. Additionally, we observed 105 single nucleotide variants (SNVs), 83% of which were detected in protein-coding regions compared to the parental strain E. coli DH1. Interestingly, 23% of the protein-coding regions have mutations in their amino acid residues, whose biological functions were categorized into two-component systems, peptidoglycan biosynthesis and lipopolysaccharide biosynthesis. These results underscore the advantages of E. coli DH5α, which tolerates the components of transformation buffer and expresses foreign plasmids efficiently. Moreover, these SNVs were also observed in the commercially available strain. These data provide the genetic information of E. coli DH5α for its future application in metabolic engineering and synthetic biology.

Keywords: Escherichia coli DH5α; genome sequencing; single nucleotide variants.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acids
  • Escherichia coli / genetics*
  • Genetic Variation*
  • Genome, Bacterial*
  • Lipopolysaccharides / genetics
  • Mutation
  • Nucleotides / analysis*
  • Peptidoglycan / genetics
  • Phenotype
  • Plasmids
  • Sequence Analysis, DNA / methods*

Substances

  • Amino Acids
  • Lipopolysaccharides
  • Nucleotides
  • Peptidoglycan