Unsupervised clustering in mRNA expression profiles

Comput Biol Med. 2006 Oct;36(10):1126-42. doi: 10.1016/j.compbiomed.2005.09.003. Epub 2005 Oct 24.

Abstract

The development of microarray technologies gives scientists the ability to examine, discover and monitor the mRNA transcript levels of thousands of genes in a single experiment. Nonetheless, the tremendous amount of data that can be obtained from microarray studies presents a challenge for data analysis. The most commonly used computational approach for analyzing microarray data is cluster analysis, since the number of genes is usually very high compared to the number of samples. In this paper, we investigate the application of the recently proposed k-windows clustering algorithm on gene expression microarray data. This algorithm apart from identifying the clusters present in a data set also calculates their number and thus requires no special knowledge about the data. To improve the quality of the clustering, we employ various dimension reduction techniques and propose a hybrid one. The results obtained by the application of the algorithm exhibit high classification success.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Artificial Intelligence*
  • Cluster Analysis*
  • Colonic Neoplasms / genetics
  • Female
  • Gene Expression Profiling / methods*
  • Humans
  • Leukemia, Myeloid, Acute / genetics
  • Lymphoma / genetics
  • Male
  • Mathematical Computing*
  • Neoplasms / genetics*
  • Neural Networks, Computer
  • Oligonucleotide Array Sequence Analysis / methods*
  • Precursor Cell Lymphoblastic Leukemia-Lymphoma / genetics
  • Prostatic Neoplasms / genetics
  • RNA, Messenger / genetics*
  • Reproducibility of Results
  • Software

Substances

  • RNA, Messenger