ArrayCluster: an analytic tool for clustering, data visualization and module finder on gene expression profiles

Bioinformatics. 2006 Jun 15;22(12):1538-9. doi: 10.1093/bioinformatics/btl129. Epub 2006 Apr 10.

Abstract

One of the significant challenges in gene expression analysis is to find unknown subtypes of several diseases at the molecular levels. This task can be addressed by grouping gene expression patterns of the collected samples on the basis of a large number of genes. Application of commonly used clustering methods to such a dataset however are likely to fail owing to over-learning, because the number of samples to be grouped is much smaller than the data dimension which is equal to the number of genes involved in the dataset. To overcome such difficulty, we developed a novel model-based clustering method, referred to as the mixed factors analysis. The ArrayCluster is a freely available software to perform the mixed factors analysis. It provides us some analytic tools for clustering DNA microarray experiments, data visualization and an automatic detector for module transcriptional of genes that are relevant to the calibrated molecular subtypes and so on.

MeSH terms

  • Bayes Theorem
  • Cluster Analysis*
  • Computational Biology / methods*
  • Gene Expression Profiling / methods*
  • Models, Statistical
  • Normal Distribution
  • Oligonucleotide Array Sequence Analysis / methods*
  • Programming Languages
  • Software