Evolutionary Characterization of the Short Protein SPAAR

Genes (Basel). 2021 Nov 24;12(12):1864. doi: 10.3390/genes12121864.

Abstract

Microproteins (<100 amino acids) are receiving increasing recognition as important participants in numerous biological processes, but their evolutionary dynamics are poorly understood. SPAAR is a recently discovered microprotein that regulates muscle regeneration and angiogenesis through interactions with conserved signaling pathways. Interestingly, SPAAR does not belong to any known protein family and has known homologs exclusively among placental mammals. This lack of distant homology could be caused by challenges in homology detection of short sequences, or it could indicate a recent de novo emergence from a noncoding sequence. By integrating syntenic alignments and homology searches, we identify SPAAR orthologs in marsupials and monotremes, establishing that SPAAR has existed at least since the emergence of mammals. SPAAR shows substantial primary sequence divergence but retains a conserved protein structure. In primates, we infer two independent evolutionary events leading to the de novo origination of 5' elongated isoforms of SPAAR from a noncoding sequence and find evidence of adaptive evolution in this extended region. Thus, SPAAR may be of ancient origin, but it appears to be experiencing continual evolutionary innovation in mammals.

Keywords: adaptation; comparative genomics; de novo gene birth; gene annotation; homology detection; lncRNAs; mTOR pathway; microproteins; noncanonical translation; protein evolution.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Evolution, Molecular
  • Female
  • Humans
  • Mammals / genetics
  • Mice
  • Opossums / genetics
  • Peptides / genetics*
  • Phylogeny
  • Placenta / metabolism
  • Platypus / genetics
  • Pregnancy
  • Primates / genetics
  • RNA, Long Noncoding / genetics*

Substances

  • Peptides
  • RNA, Long Noncoding
  • SPAR polypeptide, human