Global informatics and physical property selection in protein sequences

Proc Natl Acad Sci U S A. 2016 Feb 16;113(7):1808-10. doi: 10.1073/pnas.1525745113. Epub 2016 Feb 1.

Abstract

The degree of informatic independence between the physical properties of amino acids as encoded in actual protein sequences is calculated. It is shown that no physical property can be identified that carries significantly less information than others and that the information overlap between different properties and different length scales along the sequence is essentially zero. These observations suggest that bioinformatic models based on arbitrarily selected sets of physical properties are inherently deficient.

Keywords: Fourier analysis; information theory; physical properties; protein bioinformatics.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Computational Biology*
  • Fourier Analysis
  • Proteins / chemistry*

Substances

  • Proteins