Coevolution-based inference of amino acid interactions underlying protein function

Elife. 2018 Jul 20:7:e34300. doi: 10.7554/eLife.34300.

Abstract

Protein function arises from a poorly understood pattern of energetic interactions between amino acid residues. Sequence-based strategies for deducing this pattern have been proposed, but lack of benchmark data has limited experimental verification. Here, we extend deep-mutation technologies to enable measurement of many thousands of pairwise amino acid couplings in several homologs of a protein family - a deep coupling scan (DCS). The data show that cooperative interactions between residues are loaded in a sparse, evolutionarily conserved, spatially contiguous network of amino acids. The pattern of amino acid coupling is quantitatively captured in the coevolution of amino acid positions, especially as indicated by the statistical coupling analysis (SCA), providing experimental confirmation of the key tenets of this method. This work exposes the collective nature of physical constraints on protein function and clarifies its link with sequence analysis, enabling a general practical approach for understanding the structural basis for protein function.

Keywords: E. coli; binding; biochemistry; chemical biology; coevolution; computational biology; cooperativity; epistasis; evolution; mutagenesis; systems biology.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acids / genetics*
  • Conserved Sequence
  • Evolution, Molecular*
  • Ligands
  • Models, Biological
  • Mutation / genetics
  • PDZ Domains
  • Protein Structure, Secondary
  • Proteins / chemistry
  • Proteins / metabolism*
  • Reproducibility of Results
  • Structural Homology, Protein
  • Thermodynamics
  • Two-Hybrid System Techniques

Substances

  • Amino Acids
  • Ligands
  • Proteins