Finding dominant sets in microarray data

Xuping Fu; Li Teng; Yao Li; Wenbin Chen; Yumin Mao; I-Fan Shen; Yi Xie

doi:10.2741/1763

Finding dominant sets in microarray data

Front Biosci. 2005 Sep 1:10:3068-77. doi: 10.2741/1763.

Authors

Xuping Fu¹, Li Teng, Yao Li, Wenbin Chen, Yumin Mao, I-Fan Shen, Yi Xie

Affiliation

¹ State Key Laboratory of Genetic Engineering, Institute of Genetics, School of Life Science, Fudan University, Shanghai 200433, PR China.

PMID: 15970561
DOI: 10.2741/1763

Abstract

Clustering allows us to extract groups of genes that are tightly coexpressed from Microarray data. In this paper, a new method DSF_Clust is developed to find dominant sets (clusters). We have preformed DSF_Clust on several gene expression datasets and given the evaluation with some criteria. The results showed that this approach could cluster dominant sets of good quality compared to kmeans method. DSF_Clust deals with three issues that have bedeviled clustering, some dominant sets being statistically determined in a significance level, predefining cluster structure being not required, and the quality of a dominant set being ensured. We have also applied this approach to analyze published data of yeast cell cycle gene expression and found some biologically meaningful gene groups to be dug out. Furthermore, DSF_Clust is a potentially good tool to search for putative regulatory signals.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms*
Cluster Analysis*
Computational Biology
Gene Expression Profiling*
Gene Expression Regulation
Oligonucleotide Array Sequence Analysis
Saccharomyces cerevisiae / genetics
Saccharomyces cerevisiae / metabolism*