Ancient Evolutionary Origin of Intrinsically Disordered Cancer Risk Regions

Biomolecules. 2020 Jul 28;10(8):1115. doi: 10.3390/biom10081115.

Abstract

Cancer is a heterogeneous genetic disease that alters the proper functioning of proteins involved in key regulatory processes such as cell cycle, DNA repair, survival, or apoptosis. Mutations often accumulate in hot-spots regions, highlighting critical functional modules within these proteins that need to be altered, amplified, or abolished for tumor formation. Recent evidence suggests that these mutational hotspots can correspond not only to globular domains, but also to intrinsically disordered regions (IDRs), which play a significant role in a subset of cancer types. IDRs have distinct functional properties that originate from their inherent flexibility. Generally, they correspond to more recent evolutionary inventions and show larger sequence variations across species. In this work, we analyzed the evolutionary origin of disordered regions that are specifically targeted in cancer. Surprisingly, the majority of these disordered cancer risk regions showed remarkable conservation with ancient evolutionary origin, stemming from the earliest multicellular animals or even beyond. Nevertheless, we encountered several examples where the mutated region emerged at a later stage compared with the origin of the gene family. We also showed the cancer risk regions become quickly fixated after their emergence, but evolution continues to tinker with their genes with novel regulatory elements introduced even at the level of humans. Our concise analysis provides a much clearer picture of the emergence of key regulatory elements in proteins and highlights the importance of taking into account the modular organisation of proteins for the analyses of evolutionary origin.

Keywords: de novo; evolutionary origin; gene duplications; intrinsically disordered regions; linear motifs.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Motifs
  • Amino Acid Sequence
  • Animals
  • Evolution, Molecular
  • Gene Duplication
  • Humans
  • Intrinsically Disordered Proteins / chemistry
  • Intrinsically Disordered Proteins / genetics*
  • Mutation
  • Neoplasms / genetics*
  • Protein Conformation
  • Protein Domains

Substances

  • Intrinsically Disordered Proteins