A genome-wide map of hyper-edited RNA reveals numerous new sites

Nat Commun. 2014 Aug 27:5:4726. doi: 10.1038/ncomms5726.

Abstract

Adenosine-to-inosine editing is one of the most frequent post-transcriptional modifications, manifested as A-to-G mismatches when comparing RNA sequences with their source DNA. Recently, a number of RNA-seq data sets have been screened for the presence of A-to-G editing, and hundreds of thousands of editing sites identified. Here we show that existing screens missed the majority of sites by ignoring reads with excessive ('hyper') editing that do not easily align to the genome. We show that careful alignment and examination of the unmapped reads in RNA-seq studies reveal numerous new sites, usually many more than originally discovered, and in precisely those regions that are most heavily edited. Specifically, we discover 327,096 new editing sites in the heavily studied Illumina Human BodyMap data and more than double the number of detected sites in several published screens. We also identify thousands of new sites in mouse, rat, opossum and fly. Our results establish that hyper-editing events account for the majority of editing sites.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adenosine / genetics
  • Adenosine Deaminase / genetics
  • Animals
  • Brain / physiology
  • Genome, Human*
  • Humans
  • Inosine / genetics
  • Mice
  • MicroRNAs / genetics
  • Opossums / genetics
  • Platypus / genetics
  • RNA
  • RNA Editing*
  • RNA, Double-Stranded / chemistry
  • RNA, Double-Stranded / genetics
  • RNA-Binding Proteins / genetics
  • Rats
  • Sequence Analysis, RNA / methods*

Substances

  • MicroRNAs
  • RNA, Double-Stranded
  • RNA-Binding Proteins
  • Inosine
  • RNA
  • ADAR protein, human
  • ADARB1 protein, human
  • Adenosine Deaminase
  • Adenosine