A six-amino-acid motif is a major determinant in functional evolution of HOX1 proteins

Genes Dev. 2020 Dec 1;34(23-24):1680-1696. doi: 10.1101/gad.342329.120. Epub 2020 Nov 12.

Abstract

Gene duplication and divergence is a major driver in the emergence of evolutionary novelties. How variations in amino acid sequences lead to loss of ancestral activity and functional diversification of proteins is poorly understood. We used cross-species functional analysis of Drosophila Labial and its mouse HOX1 orthologs (HOXA1, HOXB1, and HOXD1) as a paradigm to address this issue. Mouse HOX1 proteins display low (30%) sequence similarity with Drosophila Labial. However, substituting endogenous Labial with the mouse proteins revealed that HOXA1 has retained essential ancestral functions of Labial, while HOXB1 and HOXD1 have diverged. Genome-wide analysis demonstrated similar DNA-binding patterns of HOXA1 and Labial in mouse cells, while HOXB1 binds to distinct targets. Compared with HOXB1, HOXA1 shows an enrichment in co-occupancy with PBX proteins on target sites and exists in the same complex with PBX on chromatin. Functional analysis of HOXA1-HOXB1 chimeric proteins uncovered a novel six-amino-acid C-terminal motif (CTM) flanking the homeodomain that serves as a major determinant of ancestral activity. In vitro DNA-binding experiments and structural prediction show that CTM provides an important domain for interaction of HOXA1 proteins with PBX. Our findings show that small changes outside of highly conserved DNA-binding regions can lead to profound changes in protein function.

Keywords: Drosophila; Labial; TALE proteins; functional diversification; gene duplication; mouse Hox genes; protein evolution.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Motifs / genetics*
  • Animals
  • Drosophila Proteins / genetics*
  • Drosophila melanogaster / classification
  • Drosophila melanogaster / genetics
  • Evolution, Molecular*
  • Genome-Wide Association Study
  • Homeodomain Proteins / genetics*
  • Mice
  • Models, Molecular
  • Protein Binding / genetics
  • Protein Domains
  • Structure-Activity Relationship

Substances

  • Drosophila Proteins
  • Homeodomain Proteins
  • lab protein, Drosophila