Discrete distributional differential expression (D3E)--a tool for gene expression analysis of single-cell RNA-seq data

BMC Bioinformatics. 2016 Feb 29:17:110. doi: 10.1186/s12859-016-0944-6.

Abstract

Background: The advent of high throughput RNA-seq at the single-cell level has opened up new opportunities to elucidate the heterogeneity of gene expression. One of the most widespread applications of RNA-seq is to identify genes which are differentially expressed between two experimental conditions.

Results: We present a discrete, distributional method for differential gene expression (D(3)E), a novel algorithm specifically designed for single-cell RNA-seq data. We use synthetic data to evaluate D(3)E, demonstrating that it can detect changes in expression, even when the mean level remains unchanged. Since D(3)E is based on an analytically tractable stochastic model, it provides additional biological insights by quantifying biologically meaningful properties, such as the average burst size and frequency. We use D(3)E to investigate experimental data, and with the help of the underlying model, we directly test hypotheses about the driving mechanism behind changes in gene expression.

Conclusion: Evaluation using synthetic data shows that D(3)E performs better than other methods for identifying differentially expressed genes since it is designed to take full advantage of the information available from single-cell RNA-seq experiments. Moreover, the analytical model underlying D(3)E makes it possible to gain additional biological insights.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Gene Expression Profiling*
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • RNA / genetics*
  • Sequence Analysis, RNA / methods*
  • Single-Cell Analysis / methods*
  • Statistical Distributions

Substances

  • RNA