Deciphering global signal features of high-throughput array data from cancers

Mol Biosyst. 2014 Jun;10(6):1549-56. doi: 10.1039/c4mb00084f. Epub 2014 Apr 3.

Abstract

Normalization of array data relies on the assumption that most genes are not altered, which means that the signals for different samples should be scaled to have similar median or average values. However, accumulating evidence suggests that gene expression could be widely up-regulated in cancers. Our previous results and subsequent findings have shown that violation of the assumption led to erroneous interpretation of microarray data. To decipher the global signal features of microarray data from cancer samples, we empirically evaluated a large collection of gene and miRNA expression profiles and copy-number variation arrays. Our results showed that, at the transcriptomic level, genes and miRNAs are widely over-expressed in a large proportion of cancers. In contrast, at the genomic level, global raw signal intensities for methylation and copy number variation show negligible differences between cancer and normal samples. These results force us to re-evaluate the proper use of normalization procedures under different experimental conditions and for different array platforms.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology / methods*
  • DNA Copy Number Variations
  • Gene Expression Profiling / methods
  • Gene Expression Regulation, Neoplastic
  • Humans
  • MicroRNAs / analysis
  • Neoplasms / genetics*
  • Oligonucleotide Array Sequence Analysis / methods*
  • RNA, Messenger / analysis

Substances

  • MicroRNAs
  • RNA, Messenger