Identification of RNA 3´ ends and termination sites in Haloferax volcanii

RNA Biol. 2020 May;17(5):663-676. doi: 10.1080/15476286.2020.1723328. Epub 2020 Feb 10.

Abstract

Archaeal genomes are densely packed; thus, correct transcription termination is an important factor for orchestrated gene expression. A systematic analysis of RNA 3´ termini, to identify transcription termination sites (TTS) using RNAseq data has hitherto only been performed in two archaea, Methanosarcina mazei and Sulfolobus acidocaldarius. In this study, only regions directly downstream of annotated genes were analysed, and thus, only part of the genome had been investigated. Here, we developed a novel algorithm (Internal Enrichment-Peak Calling) that allows an unbiased, genome-wide identification of RNA 3´ termini independent of annotation. In an RNA fraction enriched for primary transcripts by terminator exonuclease (TEX) treatment we identified 1,543 RNA 3´ termini. Approximately half of these were located in intergenic regions, and the remainder were found in coding regions. A strong sequence signature consistent with known termination events at intergenic loci indicates a clear enrichment for native TTS among them. Using these data we determined distinct putative termination motifs for intergenic (a T stretch) and coding regions (AGATC). In vivo reporter gene tests of selected TTS confirmed termination at these sites, which exemplify the different motifs. For several genes, more than one termination site was detected, resulting in transcripts with different lengths of the 3´ untranslated region (3´ UTR).

Keywords: 3´ UTR; Haloarchaea; Haloferax volcanii; RNA 3´ ends; RNAseq; Transcription termination; archaea.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • 3' Untranslated Regions*
  • Algorithms
  • Cluster Analysis
  • Computational Biology / methods
  • Gene Expression Regulation, Archaeal*
  • Genome, Archaeal
  • Genomics / methods
  • Haloferax volcanii / genetics*
  • Molecular Sequence Annotation
  • Nucleotide Motifs
  • Open Reading Frames
  • Operon
  • RNA, Archaeal / genetics*
  • Transcription Termination, Genetic

Substances

  • 3' Untranslated Regions
  • RNA, Archaeal

Grants and funding

This work was supported by the Deutsche Forschungsgemeinschaft [MA1538/21-1] (AM) and by the Austrian Science Fund [SFB F43] (FA).