Gene-set analysis and reduction

Brief Bioinform. 2009 Jan;10(1):24-34. doi: 10.1093/bib/bbn042. Epub 2008 Oct 4.

Abstract

Gene-set analysis aims to identify differentially expressed gene sets (pathways) by a phenotype in DNA microarray studies. We review here important methodological aspects of gene-set analysis and illustrate them with varying performance of several methods proposed in the literature. We emphasize the importance of distinguishing between 'self-contained' versus 'competitive' methods, following Goeman and Bühlmann. We also discuss reducing a gene set to its subset, consisting of 'core members' that chiefly contribute to the statistical significance of the differential expression of the initial gene set by phenotype. Significance analysis of microarray for gene-set reduction (SAM-GSR) can be used for an analytical reduction of gene sets to their core subsets. We apply SAM-GSR on a microarray dataset for identifying biological gene sets (pathways) whose gene expressions are associated with p53 mutation in cancer cell lines. Codes to implement SAM-GSR in the statistical package R can be downloaded from http://www.ualberta.ca/~yyasui/homepage.html.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Algorithms*
  • Cell Line, Tumor
  • Gene Expression Profiling / methods*
  • Humans
  • Models, Genetic
  • Models, Statistical
  • Oligonucleotide Array Sequence Analysis / methods*
  • Phenotype
  • Tumor Suppressor Protein p53 / genetics

Substances

  • Tumor Suppressor Protein p53