How Common Is Disorder? Occurrence of Disordered Residues in Four Domains of Life

Int J Mol Sci. 2015 Aug 18;16(8):19490-507. doi: 10.3390/ijms160819490.

Abstract

Disordered regions play important roles in protein adaptation to challenging environmental conditions. Flexible and disordered residues have the highest propensities to alter the protein packing. Therefore, identification of disordered/flexible regions is important for structural and functional analysis of proteins. We used the IsUnstruct program to predict the ordered or disordered status of residues in 122 proteomes, including 97 eukaryotic and 25 large bacterial proteomes larger than 2,500,000 residues. We found that bacterial and eukaryotic proteomes contain comparable fraction of disordered residues, which was 0.31 in the bacterial and 0.38 in the eukaryotic proteomes. Additional analysis of the total of 1540 bacterial proteomes of various sizes yielded a smaller fraction of disordered residues, which was only 0.26. Together, the results showed that the larger is the size of the proteome, the larger is the fraction of the disordered residues. A continuous dependence of the fraction of disordered residues on the size of the proteome is observed for four domains of life: Eukaryota, Bacteria, Archaea, and Viruses. Furthermore, our analysis of 122 proteomes showed that the fraction of disordered residues increased with increasing the length of homo-repeats for polar, charged, and small residues, and decreased for hydrophobic residues. The maximal fraction of disordered residues was obtained for proteins containing lysine and arginine homo-repeats. The minimal fraction was found in valine and leucine homo-repeats. For 15-residue long homo-repeats these values were 0.2 (for Val and Leu) and 0.7 (for Lys and Arg).

Keywords: computational prediction; disordered regions; homo-repeats; proteome.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Bacteria / chemistry
  • Bacterial Proteins / chemistry
  • Databases, Protein
  • Genomics / methods*
  • Humans
  • Intrinsically Disordered Proteins / chemistry*
  • Molecular Sequence Data
  • Protein Conformation
  • Protein Folding
  • Proteins / chemistry*
  • Proteome / chemistry

Substances

  • Bacterial Proteins
  • Intrinsically Disordered Proteins
  • Proteins
  • Proteome