Genome-wide computational determination of the human metalloproteome

Proteins. 2015 May;83(5):931-9. doi: 10.1002/prot.24790. Epub 2015 Apr 4.

Abstract

Accurate prediction of protein function in humans is important for understanding biological processes at the molecular level in biomedicine and drug design. Over a third of proteins are commonly held to bind metal, and ∼10% of human proteins, to bind zinc. Therefore, an initial step in protein function prediction frequently involves predicting metal ion binding. In recent years, methods have been developed to predict a set of residues in 3D space forming the metal-ion binding site, often with a high degree of accuracy. Here, using extensions of these methods, we provide an extensive list of human proteins and their putative metal ion binding site residues, using translated gene sequences derived from the complete, resolved human genome. Under conditions of ∼90% selectivity, over 900 new human putative metal ion binding proteins are identified. A statistical analysis of resolved metal ion binding sites in the human metalloproteome is furnished and the importance of remote homology analysis is demonstrated. As an example, a novel metal-ion binding site involving a complex of a botulinum substrate with its inhibitor is presented. On the basis of the location of the predicted site and the interactions of the contacting residues at the complex interface, we postulate that metal ion binding in this region could influence complex formation and, consequently, the functioning of the protein. Thus, this work provides testable hypotheses about novel functions of known proteins.

Keywords: botulinum substrate; human proteome; metal binding sites; protein function prediction.

MeSH terms

  • Binding Sites
  • Botulinum Toxins / chemistry
  • Coordination Complexes / chemistry
  • Genome, Human
  • Humans
  • Metalloproteins / chemistry*
  • Metalloproteins / genetics
  • Models, Molecular
  • Molecular Sequence Annotation
  • Protein Structure, Tertiary
  • Sequence Analysis, Protein
  • Software

Substances

  • Coordination Complexes
  • Metalloproteins
  • Botulinum Toxins