Long first exons and epigenetic marks distinguish conserved pachytene piRNA clusters from other mammalian genes

Nat Commun. 2021 Jan 4;12(1):73. doi: 10.1038/s41467-020-20345-3.

Abstract

In the male germ cells of placental mammals, 26-30-nt-long PIWI-interacting RNAs (piRNAs) emerge when spermatocytes enter the pachytene phase of meiosis. In mice, pachytene piRNAs derive from ~100 discrete autosomal loci that produce canonical RNA polymerase II transcripts. These piRNA clusters bear 5' caps and 3' poly(A) tails, and often contain introns that are removed before nuclear export and processing into piRNAs. What marks pachytene piRNA clusters to produce piRNAs, and what confines their expression to the germline? We report that an unusually long first exon (≥ 10 kb) or a long, unspliced transcript correlates with germline-specific transcription and piRNA production. Our integrative analysis of transcriptome, piRNA, and epigenome datasets across multiple species reveals that a long first exon is an evolutionarily conserved feature of pachytene piRNA clusters. Furthermore, a highly methylated promoter, often containing a low or intermediate level of CG dinucleotides, correlates with germline expression and somatic silencing of pachytene piRNA clusters. Pachytene piRNA precursor transcripts bind THOC1 and THOC2, THO complex subunits known to promote transcriptional elongation and mRNA nuclear export. Together, these features may explain why the major sources of pachytene piRNA clusters specifically generate these unique small RNAs in the male germline of placental mammals.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • 5-Methylcytosine / analogs & derivatives
  • 5-Methylcytosine / metabolism
  • Acetylation
  • Animals
  • DNA Methylation / genetics
  • DNA-Binding Proteins / metabolism
  • Epigenesis, Genetic*
  • Evolution, Molecular
  • Exons / genetics*
  • Histones / metabolism
  • Introns / genetics
  • Male
  • Mammals / genetics*
  • Mice
  • Mice, Inbred C57BL
  • Nuclear Proteins / metabolism
  • Organ Specificity / genetics
  • Pachytene Stage / genetics*
  • Promoter Regions, Genetic / genetics
  • RNA Splicing / genetics
  • RNA, Messenger / genetics
  • RNA, Messenger / metabolism
  • RNA, Small Interfering / metabolism*
  • RNA-Binding Proteins / metabolism
  • Signal Transduction / genetics
  • Testis / metabolism
  • Transcription, Genetic

Substances

  • BTBD18 protein, mouse
  • DNA-Binding Proteins
  • Histones
  • Nuclear Proteins
  • RNA, Messenger
  • RNA, Small Interfering
  • RNA-Binding Proteins
  • Thoc1 protein, mouse
  • 5-hydroxymethylcytosine
  • 5-Methylcytosine