Unravelling the relationship between protein sequence and low-complexity regions entropies: Interactome implications

J Theor Biol. 2015 Oct 7:382:320-7. doi: 10.1016/j.jtbi.2015.06.049. Epub 2015 Jul 8.

Abstract

Low-complexity regions are sub-sequences of biased composition in a protein sequence. The influence of these regions over protein evolution, specific functions and highly interactive capacities is well known. Although protein sequence entropy has been largely studied, its relationship with low-complexity regions and the subsequent effects on protein function remains unclear. In this work we propose a theoretical and empirical model integrating the sequence entropy with local complexity parameters. Our results indicate that the protein sequence entropy is related with the protein length, the entropies inside and outside the low-complexity regions as well as their number and average size. We found a small but significant increment in the sequence entropy of hubs proteins. In agreement with our theoretical model, this increment is highly dependent of the balance between the increment of protein length and average size of the low-complexity regions. Finally, our models and proteins analysis provide evidence supporting that modifications in the average size is more relevant in hubs proteins than changes in the number of low-complexity regions.

Keywords: Hubs; Interactome; Low-complexity-regions; Protein sequence entropy.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Databases, Protein
  • Entropy*
  • Humans
  • Logistic Models
  • Protein Interaction Maps*
  • Proteins / chemistry*
  • Sequence Analysis, Protein

Substances

  • Proteins