Information theoretic measures for quantifying sequence-ensemble relationships of intrinsically disordered proteins

Protein Eng Des Sel. 2019 Dec 31;32(4):191-202. doi: 10.1093/protein/gzz014.

Abstract

Intrinsically disordered proteins (IDPs) contribute to a multitude of functions. De novo design of IDPs should open the door to modulating functions and phenotypes controlled by these systems. Recent design efforts have focused on compositional biases and specific sequence patterns as the design features. Analysis of the impact of these designs on sequence-function relationships indicates that individual sequence/compositional parameters are insufficient for describing sequence-function relationships in IDPs. To remedy this problem, we have developed information theoretic measures for sequence-ensemble relationships (SERs) of IDPs. These measures rely on prior availability of statistically robust conformational ensembles derived from all atom simulations. We show that the measures we have developed are useful for comparing sequence-ensemble relationships even when sequence is poorly conserved. Based on our results, we propose that de novo designs of IDPs, guided by knowledge of their SERs, should provide improved insights into their sequence-ensemble-function relationships.

Keywords: computations; ensemble entropy matrix; intrinsically disordered proteins; protein design; sequence-ensemble relationships.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Entropy
  • Intrinsically Disordered Proteins / chemistry*
  • Molecular Dynamics Simulation
  • Monte Carlo Method
  • Protein Conformation

Substances

  • Intrinsically Disordered Proteins