Structure based sequence analysis of viral and cellular protein assemblies

J Struct Biol. 2016 Dec;196(3):299-308. doi: 10.1016/j.jsb.2016.07.013. Epub 2016 Jul 29.

Abstract

It is well accepted that, in general, protein structural similarity is strongly related to the amino acid sequence identity. To analyze in great detail the correlation, distribution and variation levels of conserved residues in the protein structure, we analyzed all available high-resolution structural data of 5245 cellular complex-forming proteins and 293 spherical virus capsid proteins (VCPs). We categorized and compare them in terms of protein structural regions. In all cases, the buried core residues are the most conserved, followed by the residues at the protein-protein interfaces. The solvent-exposed surface shows greater sequence variations. Our results provide evidence that cellular monomers and VCPs could be two extremes in the quaternary structural space, with cellular dimers and oligomers in between. Moreover, based on statistical analysis, we detected a distinct group of icosahedral virus families whose capsid proteins seem to evolve much slower than the rest of the protein complexes analyzed in this work.

Keywords: Bioinformatics; Protein complexes; Protein-protein interactions; Sequence conservation; Subunit interface; Viral capsid proteins.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, N.I.H., Extramural

MeSH terms

  • Amino Acid Sequence / genetics
  • Capsid Proteins / ultrastructure*
  • Conserved Sequence*
  • Crystallography, X-Ray
  • Evolution, Molecular
  • Models, Molecular
  • Protein Conformation
  • Sequence Alignment
  • Structural Homology, Protein*
  • Viral Proteins / chemistry*
  • Viruses / genetics

Substances

  • Capsid Proteins
  • Viral Proteins