Comparative genomics of microsporidian genomes reveals a minimal non-coding RNA set and new insights for transcription in minimal eukaryotic genomes

DNA Res. 2017 Jun 1;24(3):251-260. doi: 10.1093/dnares/dsx002.

Abstract

Microsporidia are ubiquitous intracellular pathogens whose opportunistic nature led to their increased recognition with the rise of the AIDS pandemic. As the RNA world was largely unexplored in this parasitic lineage, we developed a dedicated in silico methodology to carry out exhaustive identification of ncRNAs across the Encephalitozoon and Nosema genera. Thus, the previously missing U1 small nuclear RNA (snRNA) and small nucleolar RNAs (snoRNAs) targeting only the LSU rRNA were highlighted and were further validated using 5' and 3'RACE-PCR experiments. Overall, the 15 ncRNAs that were found shared between Encephalitozoon and Nosema spp. may represent the minimal core set required for parasitic life. Interestingly, the systematic presence of a CCC- or GGG-like motif in 5' of all ncRNA and mRNA gene transcripts regardless of the RNA polymerase involved suggests that the RNA polymerase machineries in microsporidia species could use common factors. Our data provide additional insights in accordance with the simplification processes observed in these reduce genomes and underline the usefulness of sequencing closely related species to help identify highly divergent ncRNAs in these parasites.

Keywords: Microsporidia; genome evolution; ncRNA prediction; ncRNA structure; transcriptional regulation signals.

MeSH terms

  • Base Sequence
  • Computer Simulation
  • Encephalitozoon / genetics*
  • Genome, Fungal*
  • Genomics
  • Nosema / genetics*
  • RNA, Small Nuclear / metabolism
  • RNA, Small Nucleolar / metabolism
  • RNA, Untranslated / metabolism*
  • Transcription, Genetic*

Substances

  • RNA, Small Nuclear
  • RNA, Small Nucleolar
  • RNA, Untranslated
  • U1 small nuclear RNA