Parallelized evolutionary learning for detection of biclusters in gene expression data

Qinghua Huang; Dacheng Tao; Xuelong Li; Alan Wee-Chung Liew

doi:10.1109/TCBB.2011.53

Parallelized evolutionary learning for detection of biclusters in gene expression data

IEEE/ACM Trans Comput Biol Bioinform. 2012;9(2):560-70. doi: 10.1109/TCBB.2011.53. Epub 2011 Mar 3.

Authors

Qinghua Huang¹, Dacheng Tao, Xuelong Li, Alan Wee-Chung Liew

Affiliation

¹ South China University of Technology, Guangzhou.

PMID: 21383419
DOI: 10.1109/TCBB.2011.53

Abstract

The analysis of gene expression data obtained from microarray experiments is important for discovering the biological process of genes. Biclustering algorithms have been proven to be able to group the genes with similar expression patterns under a number of experimental conditions. In this paper, we propose a new biclustering algorithm based on evolutionary learning. By converting the biclustering problem into a common clustering problem, the algorithm can be applied in a search space constructed by the conditions. To further reduce the size of the search space, we randomly separate the full conditions into a number of condition subsets (subspaces), each of which has a smaller number of conditions. The algorithm is applied to each subspace and is able to discover bicluster seeds within a limited computing time. Finally, an expanding and merging procedure is employed to combine the bicluster seeds into larger biclusters according to a homogeneity criterion. We test the performance of the proposed algorithm using synthetic and real microarray data sets. Compared with several previously developed biclustering algorithms, our algorithm demonstrates a significant improvement in discovering additive biclusters.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Artificial Intelligence*
Cluster Analysis*
Colonic Neoplasms / genetics
Colonic Neoplasms / metabolism
Computational Biology / methods*
Computer Simulation
Databases, Genetic
Gene Expression Profiling / methods*
Humans
Models, Genetic*
Oligonucleotide Array Sequence Analysis
Pattern Recognition, Automated / methods
Saccharomyces cerevisiae / genetics
Saccharomyces cerevisiae / metabolism