Network constrained clustering for gene microarray data

Bioinformatics. 2005 Nov 1;21(21):4014-20. doi: 10.1093/bioinformatics/bti655. Epub 2005 Sep 1.

Abstract

Many bioinformatics problems can be tackled from a fresh angle offered by the network perspective. Directly inspired by metabolic network structural studies, we propose an improved gene clustering approach for inferring gene signaling pathways from gene microarray data. Based on the construction of co-expression networks that consists of both significantly linear and non-linear gene associations together with controlled biological and statistical significance, our approach tends to group functionally related genes into tight clusters despite their expression dissimilarities. We illustrate our approach and compare it to the traditional clustering approaches on a yeast galactose metabolism dataset and a retinal gene expression dataset. Our approach greatly outperforms the traditional approach in rediscovering the relatively well known galactose metabolism pathway in yeast and in clustering genes of the photoreceptor differentiation pathway.

Availability: The clustering method has been implemented in an R package "GeneNT" that is freely available from: http://www.cran.org.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Cluster Analysis
  • Computer Simulation
  • Gene Expression Profiling / methods*
  • Gene Expression Regulation / physiology*
  • Models, Biological*
  • Oligonucleotide Array Sequence Analysis / methods*
  • Pattern Recognition, Automated / methods
  • Proteome / metabolism*
  • Signal Transduction / physiology*
  • Transcription Factors / metabolism*
  • Transcriptional Activation / physiology

Substances

  • Proteome
  • Transcription Factors