Proportion statistics to detect differentially expressed genes: a comparison with log-ratio statistics

BMC Bioinformatics. 2011 Jun 7:12:228. doi: 10.1186/1471-2105-12-228.

Abstract

Background: In genetic transcription research, gene expression is typically reported in a test sample relative to a reference sample. Laboratory assays that measure gene expression levels, from Q-RT-PCR to microarrays to RNA-Seq experiments, will compare two samples to the same genetic sequence of interest. Standard practice is to use the log(2)-ratio as the measure of relative expression. There are drawbacks to using this measurement, including unstable ratios when the denominator is small. This paper suggests an alternative estimate based on a proportion that is just as simple to calculate, just as intuitive, with the added benefit of greater numerical stability.

Results: Analysis of two groups of mice measured with 16 cDNA microarrays found similar results between the previously used methods and our proposed methods. In a study of liver and kidney samples measured with RNA-Seq, we found that proportion statistics could detect additional differentially expressed genes usually classified as missing by ratio statistics. Additionally, simulations demonstrated that one of our proposed proportion-based test statistics was robust to deviations from distributional assumptions where all other methods examined were not.

Conclusions: To measure relative expression between two samples, the proportion estimates that we propose yield equivalent results to the log(2)-ratio under most circumstances and better results than the log(2)-ratio when expression values are close to zero.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Apolipoprotein A-I / genetics
  • Cells, Cultured
  • Gene Expression Profiling / methods*
  • Humans
  • Kidney / cytology
  • Kidney / metabolism
  • Liver / cytology
  • Liver / metabolism
  • Mice
  • Oligonucleotide Array Sequence Analysis / methods*
  • Reverse Transcriptase Polymerase Chain Reaction / methods
  • Sequence Analysis, RNA

Substances

  • Apolipoprotein A-I