AlphaFold2-guided description of CoBaHMA, a novel family of bacterial domains within the heavy-metal-associated superfamily

Proteins. 2024 Jun;92(6):776-794. doi: 10.1002/prot.26668. Epub 2024 Jan 22.

Abstract

Three-dimensional (3D) structure information, now available at the proteome scale, may facilitate the detection of remote evolutionary relationships in protein superfamilies. Here, we illustrate this with the identification of a novel family of protein domains related to the ferredoxin-like superfold, by combining (i) transitive sequence similarity searches, (ii) clustering approaches, and (iii) the use of AlphaFold2 3D structure models. Domains of this family were initially identified in relation with the intracellular biomineralization of calcium carbonates by Cyanobacteria. They are part of the large heavy-metal-associated (HMA) superfamily, departing from the latter by specific sequence and structural features. In particular, most of them share conserved basic amino acids (hence their name CoBaHMA for Conserved Basic residues HMA), forming a positively charged surface, which is likely to interact with anionic partners. CoBaHMA domains are found in diverse modular organizations in bacteria, existing in the form of monodomain proteins or as part of larger proteins, some of which are membrane proteins involved in transport or lipid metabolism. This suggests that the CoBaHMA domains may exert a regulatory function, involving interactions with anionic lipids. This hypothesis might have a particular resonance in the context of the compartmentalization observed for cyanobacterial intracellular calcium carbonates.

Keywords: ABC transporter; AlphaFold2; CoBaHMA; P1B‐ATPase; PAP2; biomineralization; functional annotation; graph clustering; heavy‐metal‐associated; modular organization; sequence similarity search.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence*
  • Bacterial Proteins* / chemistry
  • Bacterial Proteins* / genetics
  • Bacterial Proteins* / metabolism
  • Cyanobacteria / chemistry
  • Cyanobacteria / genetics
  • Cyanobacteria / metabolism
  • Ferredoxins / chemistry
  • Ferredoxins / metabolism
  • Metals, Heavy* / chemistry
  • Metals, Heavy* / metabolism
  • Models, Molecular*
  • Protein Domains
  • Protein Folding

Substances

  • Bacterial Proteins
  • Metals, Heavy
  • Ferredoxins