A single unidirectional piRNA cluster similar to the flamenco locus is the major source of EVE-derived transcription and small RNAs in Aedes aegypti mosquitoes

RNA. 2020 May;26(5):581-594. doi: 10.1261/rna.073965.119. Epub 2020 Jan 29.

Abstract

Endogenous viral elements (EVEs) are found in many eukaryotic genomes. Despite considerable knowledge about genomic elements such as transposons (TEs) and retroviruses, we still lack information about nonretroviral EVEs. Aedes aegypti mosquitoes have a highly repetitive genome that is covered with EVEs. Here, we identified 129 nonretroviral EVEs in the AaegL5 version of the A. aegypti genome. These EVEs were significantly associated with TEs and preferentially located in repeat-rich clusters within intergenic regions. Genome-wide transcriptome analysis showed that most EVEs generated transcripts although only around 1.4% were sense RNAs. The majority of EVE transcription was antisense and correlated with the generation of EVE-derived small RNAs. A single genomic cluster of EVEs located in a 143 kb repetitive region in chromosome 2 contributed with 42% of antisense transcription and 45% of small RNAs derived from viral elements. This region was enriched for TE-EVE hybrids organized in the same coding strand. These generated a single long antisense transcript that correlated with the generation of phased primary PIWI-interacting RNAs (piRNAs). The putative promoter of this region had a conserved binding site for the transcription factor Cubitus interruptus, a key regulator of the flamenco locus in Drosophila melanogaster Here, we have identified a single unidirectional piRNA cluster in the A. aegypti genome that is the major source of EVE transcription fueling the generation of antisense small RNAs in mosquitoes. We propose that this region is a flamenco-like locus in A. aegypti due to its relatedness to the major unidirectional piRNA cluster in Drosophila melanogaster.

Keywords: A. aegypti; EVE; RNA interference; endogenous viral elements; flamenco locus; piRNAs.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Aedes / genetics*
  • Animals
  • Binding Sites / genetics
  • Cadherins / genetics
  • Culicidae / genetics
  • DNA-Binding Proteins / genetics
  • Drosophila Proteins / genetics
  • Drosophila melanogaster / genetics
  • Genome, Insect / genetics*
  • Homeodomain Proteins / genetics
  • Promoter Regions, Genetic
  • RNA, Small Interfering / genetics*
  • Retroelements / genetics*
  • Transcription Factors / genetics

Substances

  • Cadherins
  • DNA-Binding Proteins
  • Drosophila Proteins
  • Homeodomain Proteins
  • RNA, Small Interfering
  • Retroelements
  • Transcription Factors
  • ci protein, Drosophila
  • eve protein, Drosophila
  • stan protein, Drosophila