Identifying folding nucleus based on residue contact networks of proteins

Proteins. 2008 Jun;71(4):1899-907. doi: 10.1002/prot.21891.

Abstract

In the native structure of a protein, all the residues are tightly parked together in a specific order following its folding and every residue contacts with some spatially neighbor residues. A residue contact network can be constructed by defining the residues as nodes and the native contacts as edges. During the folding of small single-domain proteins, there is a set of contacts (or bonds), defined as the folding nucleus (FN), which is formed around the transition state, i.e., a rate-limiting barrier located at about the middle between the unfolded states and the native state on the free energy landscape. Such a FN plays an essential role in the folding dynamics and the residues, which form the related contacts called as folding nucleus residues (FNRs). In this work, the FNRs in proteins are identified by using quantities which characterize the topology of residue contact networks of proteins. By comparing the specificities of residues with the network quantities K(R), L(R), and D(R), up to 90% FNRs of six typical proteins found experimentally are identified. It is found that the FNRs behave the full-closeness centrals rather than degree or closeness centers in the residue contact network, implying that they are important to the folding cooperativity of proteins. Our study shows that the FNRs can be identified solely from the native structures of proteins based on the analysis of residue contact network without any knowledge of the transition state ensemble.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Amino Acids / chemistry
  • Hydrogen Bonding
  • Hydrophobic and Hydrophilic Interactions
  • Models, Molecular
  • Molecular Sequence Data
  • Mutation
  • Protein Conformation
  • Protein Folding*
  • Protein Structure, Secondary
  • Proteins / chemistry*
  • Proteins / genetics
  • Sensitivity and Specificity
  • Thermodynamics

Substances

  • Amino Acids
  • Proteins