Ranking analysis of microarray data: a powerful method for identifying differentially expressed genes

Yuan-De Tan; Myriam Fornage; Yun-Xin Fu

doi:10.1016/j.ygeno.2006.08.003

Ranking analysis of microarray data: a powerful method for identifying differentially expressed genes

Genomics. 2006 Dec;88(6):846-854. doi: 10.1016/j.ygeno.2006.08.003. Epub 2006 Sep 18.

Authors

Yuan-De Tan¹, Myriam Fornage¹, Yun-Xin Fu²

Affiliations

¹ Institute of Molecular Medicine, School of Public Health, University of Texas at Houston, Houston, TX 77030, USA.
² Laboratory for Conservation and Utilization of Bioresources, Yunnan University, Kunming, Yunnan 650, China; Human Genetics Center, School of Public Health, University of Texas at Houston, Houston, TX 77030, USA. Electronic address: Yunxin.fu@uth.tmc.edu.

Abstract

Microarray technology provides a powerful tool for the expression profile of thousands of genes simultaneously, which makes it possible to explore the molecular and metabolic etiology of the development of a complex disease under study. However, classical statistical methods and technologies fail to be applicable to microarray data. Therefore, it is necessary and motivating to develop powerful methods for large-scale statistical analyses. In this paper, we described a novel method, called Ranking Analysis of Microarray Data (RAM). RAM, which is a large-scale two-sample t-test method, is based on comparisons between a set of ranked T statistics and a set of ranked Z values (a set of ranked estimated null scores) yielded by a "randomly splitting" approach instead of a "permutation" approach and a two-simulation strategy for estimating the proportion of genes identified by chance, i.e., the false discovery rate (FDR). The results obtained from the simulated and observed microarray data show that RAM is more efficient in identification of genes differentially expressed and estimation of FDR under undesirable conditions such as a large fudge factor, small sample size, or mixture distribution of noises than Significance Analysis of Microarrays.

Publication types

Evaluation Study

MeSH terms

Animals
Computational Biology / methods*
False Positive Reactions
Gene Expression Profiling*
Models, Statistical
Oligonucleotide Array Sequence Analysis / methods*
Proteins / genetics
Proteins / metabolism*
Random Allocation
Rats
Sample Size
Stroke*

Substances

Proteins

Abstract

Publication types

MeSH terms

Substances

Grants and funding