Sequence diversity of Bacillus thuringiensis flagellin (H antigen) protein at the intra-H serotype level

Appl Environ Microbiol. 2008 Sep;74(17):5524-32. doi: 10.1128/AEM.00951-08. Epub 2008 Jun 27.

Abstract

In Bacillus thuringiensis, the hag gene encodes flagellin, the protein responsible for eliciting the immunological reaction in H serotyping. Specific flagellin amino acid sequences have been correlated to specific B. thuringiensis H serotypes, H1 to H67. Ten H serotypes, however, contain three or more antigenic subfactors, labeled a, b, c, d, or e, and have been subdivided into 23 serovars. In the present study, we set out to analyze the sequence diversity of flagellins among serovars from the same H serotypes. We studied the hag genes in 39 B. thuringiensis strains representing the 23 serovars from the 10 H serotypes mentioned above. A serovar and a biovar from an 11th H serotype were also included. The hag genes were amplified and cloned and their nucleotide sequences were determined and translated into amino acid sequences, or the sequences were retrieved directly from GenBank when available. Strains of the H3 serotype contained two or three copies of the fla gene, an ortholog of the hag gene. Strains of the H6 serotype contained three copies. Strains of all other H serotypes each contained a single copy of the hag gene. Alignments of amino acid sequences from all copies in all strains of the H3 serotype revealed short signature sequences, GGAG and SGG, GPDPDDAVKNLT, and DITTTK, that appeared to be specific to the H3c, H3d, and H3e antigenic subfactors, respectively. Similar short signature sequences, GDIT, AFIK, TSAGKA, and SAPSKG, were revealed for H8b, H8c, H20b, and H20c, respectively. Amino acid sequences in the flagellin central variable region were highly conserved among serovars of the H3, H5, H11, and H20 serotypes and much more divergent among serovars of the H4, H10, H18, H24, and H28 serotypes. Two bootstrapped neighbor-joining trees were respectively generated from the alignments of the amino acid sequences translated from all copies of the hag genes in the B. thuringiensis strains of the H3 and H6 serotypes. Sequence identities and relationships were revealed. A third bootstrapped neighbor-joining tree was generated, this one from the alignment of the flagellin amino acid sequences from all the B. thuringiensis strains in the study. Eight clusters, I to VIII, were revealed. Although most clusters contained strains and serovars from the same H serotype, clusters VII and VIII contained serovars from different H serotypes.

MeSH terms

  • Amino Acid Sequence
  • Antigenic Variation*
  • Antigens, Bacterial / genetics*
  • Bacillus thuringiensis / classification
  • Bacillus thuringiensis / genetics*
  • Cloning, Molecular
  • DNA, Bacterial / genetics
  • Flagellin / genetics*
  • Genes, Bacterial
  • Genetic Variation
  • Molecular Sequence Data
  • Operon
  • Phylogeny
  • Sequence Alignment
  • Sequence Analysis, Protein
  • Serotyping

Substances

  • Antigens, Bacterial
  • DNA, Bacterial
  • H antigen
  • Flagellin

Associated data

  • GENBANK/EF595771
  • GENBANK/EF595772
  • GENBANK/EF595773
  • GENBANK/EF595774
  • GENBANK/EF595775
  • GENBANK/EF595776
  • GENBANK/EF595777
  • GENBANK/EF595778
  • GENBANK/EF595779
  • GENBANK/EF595780
  • GENBANK/EF595781
  • GENBANK/EF595782
  • GENBANK/EF595783
  • GENBANK/EF595784
  • GENBANK/EF595785
  • GENBANK/EF595786
  • GENBANK/EF595787
  • GENBANK/EF595788
  • GENBANK/EF595789
  • GENBANK/EF595790
  • GENBANK/EF595791