Reproducible features of small RNAs in C. elegans reveal NU RNAs and provide insights into 22G RNAs and 26G RNAs

RNA. 2016 Feb;22(2):184-92. doi: 10.1261/rna.054551.115. Epub 2015 Dec 8.

Abstract

Small RNAs regulate gene expression and most genes in the worm Caenorhabditis elegans are subject to their regulation. Here, we analyze small RNA data sets and use reproducible features of RNAs present in multiple data sets to discover a new class of small RNAs and to reveal insights into two known classes of small RNAs--22G RNAs and 26G RNAs. We found that reproducibly detected 22-nt RNAs, although are predominantly RNAs with a G at the 5' end, also include RNAs with A, C, or U at the 5' end. These RNAs are synthesized downstream from characteristic sequence motifs on mRNA and have U-tailed derivatives. Analysis of 26G RNAs revealed that they are processed from a blunt end of double-stranded RNAs and that production of one 26G RNA generates a hotspot immediately downstream for production of another. To our surprise, analysis of RNAs shorter than 18 nt revealed a new class of RNAs, which we call NU RNAs (pronounced "new RNAs") because they have a NU bias at the 5' end, where N is any nucleotide. NU RNAs are antisense to genes and originate downstream from U bases on mRNA. Although many genes have complementary NU RNAs, their genome-wide distribution is distinct from that of previously known classes of small RNAs. Our results suggest that current approaches underestimate reproducibly detected RNAs that are shorter than 18 nt, and theoretical considerations suggest that such shorter RNAs could be used for sequence-specific gene regulation in organisms like C. elegans that have small genomes.

Keywords: RNA-seq; RNAi; gene silencing.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Animals
  • Caenorhabditis elegans / genetics*
  • Caenorhabditis elegans / metabolism
  • Caenorhabditis elegans Proteins / genetics
  • Caenorhabditis elegans Proteins / metabolism
  • Gene Silencing*
  • Molecular Sequence Data
  • Nucleotide Motifs
  • RNA, Double-Stranded / genetics*
  • RNA, Double-Stranded / metabolism
  • RNA, Small Interfering / chemistry
  • RNA, Small Interfering / genetics*
  • RNA, Small Interfering / metabolism
  • RNA-Dependent RNA Polymerase / genetics
  • RNA-Dependent RNA Polymerase / metabolism
  • Ribonuclease III / genetics
  • Ribonuclease III / metabolism

Substances

  • Caenorhabditis elegans Proteins
  • RNA, Double-Stranded
  • RNA, Small Interfering
  • RNA-Dependent RNA Polymerase
  • RNA-directed RNA polymerase RRF-3, C elegans
  • dcr-1 protein, C elegans
  • Ribonuclease III