Uncovering transcriptional dark matter via gene annotation independent single-cell RNA sequencing analysis

Nat Commun. 2021 Apr 12;12(1):2158. doi: 10.1038/s41467-021-22496-3.

Abstract

Conventional scRNA-seq expression analyses rely on the availability of a high quality genome annotation. Yet, as we show here with scRNA-seq experiments and analyses spanning human, mouse, chicken, mole rat, lemur and sea urchin, genome annotations are often incomplete, in particular for organisms that are not routinely studied. To overcome this hurdle, we created a scRNA-seq analysis routine that recovers biologically relevant transcriptional activity beyond the scope of the best available genome annotation by performing scRNA-seq analysis on any region in the genome for which transcriptional products are detected. Our tool generates a single-cell expression matrix for all transcriptionally active regions (TARs), performs single-cell TAR expression analysis to identify biologically significant TARs, and then annotates TARs using gene homology analysis. This procedure uses single-cell expression analyses as a filter to direct annotation efforts to biologically significant transcripts and thereby uncovers biology to which scRNA-seq would otherwise be in the dark.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Chick Embryo
  • Gene Expression Regulation
  • Genetic Markers
  • Genome
  • Heart / embryology
  • Humans
  • Molecular Sequence Annotation*
  • RNA, Messenger / genetics
  • RNA, Messenger / metabolism
  • Sequence Analysis, RNA*
  • Single-Cell Analysis*
  • Transcription, Genetic*
  • Transcriptome / genetics

Substances

  • Genetic Markers
  • RNA, Messenger