Ranking analysis of microarray data: a powerful method for identifying differentially expressed genes

Genomics. 2006 Dec;88(6):846-854. doi: 10.1016/j.ygeno.2006.08.003. Epub 2006 Sep 18.

Abstract

Microarray technology provides a powerful tool for the expression profile of thousands of genes simultaneously, which makes it possible to explore the molecular and metabolic etiology of the development of a complex disease under study. However, classical statistical methods and technologies fail to be applicable to microarray data. Therefore, it is necessary and motivating to develop powerful methods for large-scale statistical analyses. In this paper, we described a novel method, called Ranking Analysis of Microarray Data (RAM). RAM, which is a large-scale two-sample t-test method, is based on comparisons between a set of ranked T statistics and a set of ranked Z values (a set of ranked estimated null scores) yielded by a "randomly splitting" approach instead of a "permutation" approach and a two-simulation strategy for estimating the proportion of genes identified by chance, i.e., the false discovery rate (FDR). The results obtained from the simulated and observed microarray data show that RAM is more efficient in identification of genes differentially expressed and estimation of FDR under undesirable conditions such as a large fudge factor, small sample size, or mixture distribution of noises than Significance Analysis of Microarrays.

Publication types

  • Evaluation Study

MeSH terms

  • Animals
  • Computational Biology / methods*
  • False Positive Reactions
  • Gene Expression Profiling*
  • Models, Statistical
  • Oligonucleotide Array Sequence Analysis / methods*
  • Proteins / genetics
  • Proteins / metabolism*
  • Random Allocation
  • Rats
  • Sample Size
  • Stroke*

Substances

  • Proteins