Integron gene cassettes: a repository of novel protein folds with distinct interaction sites

PLoS One. 2013;8(1):e52934. doi: 10.1371/journal.pone.0052934. Epub 2013 Jan 18.

Abstract

Mobile gene cassettes captured within integron arrays encompass a vast and diverse pool of genetic novelty. In most cases, functional annotation of gene cassettes directly recovered by cassette-PCR is obscured by their characteristically high sequence novelty. This inhibits identification of those specific functions or biological features that might constitute preferential factors for lateral gene transfer via the integron system. A structural genomics approach incorporating x-ray crystallography has been utilised on a selection of cassettes to investigate evolutionary relationships hidden at the sequence level. Gene cassettes were accessed from marine sediments (pristine and contaminated sites), as well as a range of Vibrio spp. We present six crystal structures, a remarkably high proportion of our survey of soluble proteins, which were found to possess novel folds. These entirely new structures are diverse, encompassing all-α, α+β and α/β fold classes, and many contain clear binding pocket features for small molecule substrates. The new structures emphasise the large repertoire of protein families encoded within the integron cassette metagenome and which remain to be characterised. Oligomeric association is a notable recurring property common to these new integron-derived proteins. In some cases, the protein-protein contact sites utilised in homomeric assembly could instead form suitable contact points for heterogeneous regulator/activator proteins or domains. Such functional features are ideal for a flexible molecular componentry needed to ensure responsive and adaptive bacterial functions.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Bacterial Proteins / chemistry
  • Bacterial Proteins / genetics*
  • Bacterial Proteins / metabolism*
  • Binding Sites
  • Crystallography, X-Ray
  • Gene Transfer, Horizontal / genetics
  • Genes, Bacterial / genetics*
  • Integrons / genetics*
  • Metagenome / genetics
  • Models, Molecular
  • Molecular Sequence Data
  • Oligonucleotide Array Sequence Analysis
  • Protein Binding
  • Protein Structure, Secondary
  • Vibrio cholerae / genetics
  • Vibrio cholerae / metabolism

Substances

  • Bacterial Proteins