Origins of Myc proteins--using intrinsic protein disorder to trace distant relatives

PLoS One. 2013 Sep 24;8(9):e75057. doi: 10.1371/journal.pone.0075057. eCollection 2013.

Abstract

Mammalian Myc proteins are important determinants of cell proliferation as well as the undifferentiated state of stem cells and their activity is frequently deregulated in cancer. Based mainly on conservation in the C-terminal DNA-binding and dimerization domain, Myc-like proteins have been reported in many simpler organisms within and outside the Metazoa but they have not been found in fungi or plants. Several important signature motifs defining mammalian Myc proteins are found in the N-terminal domain but the extent to which these are found in the Myc-like proteins from simpler organisms is not well established. The extent of N-terminal signature sequence conservation would give important insights about the evolution of Myc proteins and their current function in mammalian physiology and disease. In a systematic study of Myc-like proteins we show that N-terminal signature motifs are not readily detectable in individual Myc-like proteins from invertebrates but that weak similarities to Myc boxes 1 and 2 can be found in the N-termini of the simplest Metazoa as well as the unicellular choanoflagellate, Monosiga brevicollis, using multiple protein alignments. Phylogenetic support for the connections of these proteins to established Myc proteins is however poor. We show that the pattern of predicted protein disorder along the length of Myc proteins can be used as a complementary approach to making dendrograms of Myc proteins that aids the classification of Myc proteins. This suggests that the pattern of disorder within Myc proteins is more conserved through evolution than their amino acid sequence. In the disorder-based dendrograms the Myc-like proteins from simpler organisms, including M. brevicollis, are connected to established Myc proteins with a higher degree of certainty. Our results suggest that protein disorder based dendrograms may be of general significance for studying distant relationships between proteins, such as transcription factors, that have high levels of intrinsic disorder.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Motifs
  • Amino Acid Sequence
  • Amino Acid Substitution
  • Conserved Sequence
  • Humans
  • Intrinsically Disordered Proteins / metabolism*
  • Molecular Sequence Data
  • Neoplasms / metabolism
  • Phylogeny*
  • Protein Structure, Tertiary
  • Proto-Oncogene Proteins c-myc / chemistry
  • Proto-Oncogene Proteins c-myc / genetics*
  • Proto-Oncogene Proteins c-myc / metabolism
  • Sequence Alignment

Substances

  • Intrinsically Disordered Proteins
  • Proto-Oncogene Proteins c-myc

Grants and funding

The work was financed by the Swedish Research Council (www.vr.se) and the Swedish Cancer Society (www.cancerfonden.se). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.