Impact of DNA microarray data transformation on gene expression analysis - comparison of two normalization methods

Acta Biochim Pol. 2011;58(4):573-80. Epub 2011 Dec 20.

Abstract

Two-color DNA microarrays are commonly used for the analysis of global gene expression. They provide information on relative abundance of thousands of mRNAs. However, the generated data need to be normalized to minimize systematic variations so that biologically significant differences can be more easily identified. A large number of normalization procedures have been proposed and many softwares for microarray data analysis are available. Here, we have applied two normalization methods (median and loess) from two packages of microarray data analysis softwares. They were examined using a sample data set. We found that the number of genes identified as differentially expressed varied significantly depending on the method applied. The obtained results, i.e. lists of differentially expressed genes, were consistent only when we used median normalization methods. Loess normalization implemented in the two software packages provided less coherent and for some probes even contradictory results. In general, our results provide an additional piece of evidence that the normalization method can profoundly influence final results of DNA microarray-based analysis. The impact of the normalization method depends greatly on the algorithm employed. Consequently, the normalization procedure must be carefully considered and optimized for each individual data set.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Bacterial Adhesion
  • Caco-2 Cells
  • Data Interpretation, Statistical
  • Gene Expression Profiling / methods*
  • Gene Expression Profiling / standards
  • Gene Expression Regulation
  • Humans
  • Lactobacillus / growth & development
  • Lactobacillus / metabolism
  • Oligonucleotide Array Sequence Analysis / methods*
  • Oligonucleotide Array Sequence Analysis / standards
  • Probiotics / administration & dosage
  • Reproducibility of Results
  • Sensitivity and Specificity
  • Software*
  • Transcriptome