High-resolution mapping of protein sequence-function relationships

Nat Methods. 2010 Sep;7(9):741-6. doi: 10.1038/nmeth.1492. Epub 2010 Aug 15.

Abstract

We present a large-scale approach to investigate the functional consequences of sequence variation in a protein. The approach entails the display of hundreds of thousands of protein variants, moderate selection for activity and high-throughput DNA sequencing to quantify the performance of each variant. Using this strategy, we tracked the performance of >600,000 variants of a human WW domain after three and six rounds of selection by phage display for binding to its peptide ligand. Binding properties of these variants defined a high-resolution map of mutational preference across the WW domain; each position had unique features that could not be captured by a few representative mutations. Our approach could be applied to many in vitro or in vivo protein assays, providing a general means for understanding how protein function relates to sequence.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • DNA / genetics
  • Databases, Nucleic Acid
  • High-Throughput Screening Assays / methods*
  • Humans
  • Peptide Library
  • Protein Array Analysis / methods*
  • Proteins / chemistry*
  • Proteins / genetics
  • Proteins / metabolism*
  • Sequence Analysis, DNA
  • Structure-Activity Relationship

Substances

  • Peptide Library
  • Proteins
  • DNA