Early vertebrate origin of CTCFL, a CTCF paralog, revealed by proximity-guided shark genome scaffolding

Sci Rep. 2020 Sep 3;10(1):14629. doi: 10.1038/s41598-020-71602-w.

Abstract

The nuclear protein CCCTC-binding factor (CTCF) contributes as an insulator to chromatin organization in diverse animals. The gene encoding this protein has a paralog which was first identified to be expressed exclusively in the testis in mammals and designated as CTCFL (also called BORIS). CTCFL orthologs were reported only among amniotes, and thus CTCFL was once thought to have arisen in the amniote lineage. In this study, we identified elasmobranch CTCFL orthologs, and investigated its origin with the aid of a shark genome assembly improved by proximity-guided scaffolding. Our analysis employing evolutionary interpretation of syntenic gene location suggested an earlier timing of the gene duplication between CTCF and CTCFL than previously thought, that is, around the common ancestor of extant vertebrates. Also, our transcriptomic sequencing revealed a biased expression of the catshark CTCFL in the testis, suggesting the origin of the tissue-specific localization in mammals more than 400 million years ago. To understand the historical process of the functional consolidation of the long-standing chromatin regulator CTCF, its additional paralogs remaining in some of the descendant lineages for spatially restricted transcript distribution should be taken into consideration.

MeSH terms

  • Animals
  • CCCTC-Binding Factor / genetics*
  • DNA-Binding Proteins / genetics*
  • Gene Duplication
  • Genome
  • Sharks / genetics*

Substances

  • CCCTC-Binding Factor
  • DNA-Binding Proteins