Divergent evolution profiles of DD37D and DD39D families of Tc1/mariner transposons in eukaryotes

Mol Phylogenet Evol. 2021 Aug:161:107143. doi: 10.1016/j.ympev.2021.107143. Epub 2021 Mar 10.

Abstract

DNA transposons play a significant role in shaping the size and structure of eukaryotic genomes. The Tc1/mariner transposons are the most diverse and widely distributed superfamily of DNA transposons and the structure and distribution of several Tc1/mariner families, such as DD35E/TR, DD36E/IC, DD37E/TRT, and DD41D/VS, have been well studied. Nonetheless, a greater understanding of the structure and diversity of Tc1/mariner transposons will provide insight into the evolutionary history of eukaryotic genomes. Here, we conducted further analysis of DD37D/maT and DD39D (named Guest, GT), which were identified by the specific catalytic domains DD37D and DD39D. Most transposons of the maT family have a total length of approximately 1.3 kb and harbor a single open reading frame encoding a ~ 346 amino acid (range 302-398 aa) transposase protein, flanked by short terminal inverted repeats (TIRs) (13-48 base pairs, bp). In contrast, GTs transposons were longer (2.0-5.8 kb), encoded a transposase protein of ~400 aa (range 140-592 aa), and were flanked by short TIRs (19-41 bp). Several conserved motifs, including two helix-turn-helix (HTH) motifs, a GRPR (GRKR) motif, a nuclear localization sequence, and a DDD domain, were also identified in maT and GT transposases. Phylogenetic analyses of the DDD domain showed that the maT and GT families each belong to a monophyletic clade and appear to be closely related to DD41D/VS and DD34D/mariner. In addition, maTs are mainly distributed in invertebrates (144 species), whereas GTs are mainly distributed in land plants through a small number of GTs are present in Chromista and animals. Sequence identity and phylogenetic analysis revealed that horizontal transfer (HT) events of maT and GT might occur between kingdoms and phyla of eukaryotes; however, pairwise distance comparisons between host genes and transposons indicated that HT events involving maTs might be less frequent between invertebrate species and HT events involving GTs may be less frequent between land plant species. Overall, the DD37D/maT and DD39D/GT families display significantly different distribution and tend to be identified in more ancient evolutionary families. The discovery of intact transposases, perfect TIRs, and target site duplications (TSD) of maTs and GTs illustrates that the DD37D/maT and DD39D/GT families may be active. Together, these findings improve our understanding of the diversity of Tc1/mariner transposons and their impact on eukaryotic genome evolution.

Keywords: DD37D; DD39D; Evolution; Guest; Horizontal transfer; Tc1/mariner; Transposable elements; Transposons; maT.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • DNA Transposable Elements / genetics*
  • Eukaryota / genetics*
  • Evolution, Molecular*
  • Invertebrates / genetics
  • Phylogeny
  • Transposases / genetics*

Substances

  • DNA Transposable Elements
  • Transposases