How is structural divergence related to evolutionary information?

Mol Phylogenet Evol. 2018 Oct:127:859-866. doi: 10.1016/j.ympev.2018.06.033. Epub 2018 Jun 25.

Abstract

The analysis of evolutionary information in a protein family, such as conservation and covariation, is often linked to its structural information. Multiple sequence alignments of distant homologous sequences are used to measure evolutionary variables. Although high structural differences between proteins can be expected in such divergent alignments, most works linking evolutionary and structural information use a single structure ignoring the structural variability within protein families. The goal of this work is to elucidate the relevance of structural divergence when sequence-based measures are integrated with structural information. We found that inter-residue contacts and solvent accessibility undergo large variations in protein families. Our results show that high covariation scores tend to reveal residue contacts that are conserved in the family, instead of protein or conformer specific contacts. We also found that residue accessible surface area shows a high variability between structures of the same family. As a consequence, the mean relative solvent accessibility of multiple structures correlates better with the conservation pattern than the relative solvent accessibility of a single structure. We conclude that the use of comprehensive structural information allows a more accurate interpretation of the information computed from sequence alignments. Therefore, considering structural divergence would lead to a better understanding of protein function, dynamics, and evolution.

Keywords: Coevolution; Conformational diversity; Conservation; Solvent accessibility; Structural divergence.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Amino Acids / genetics
  • Area Under Curve
  • Conserved Sequence / genetics
  • Evolution, Molecular*
  • Phylogeny
  • Protein Domains
  • Protein Kinases / chemistry
  • Proteins / chemistry*
  • Proteins / genetics*
  • Sequence Alignment
  • Solvents
  • Statistics, Nonparametric

Substances

  • Amino Acids
  • Proteins
  • Solvents
  • Protein Kinases