Supervised cluster analysis for microarray data based on multivariate Gaussian mixture

Bioinformatics. 2004 Aug 12;20(12):1905-13. doi: 10.1093/bioinformatics/bth177. Epub 2004 Mar 25.

Abstract

Motivation: Grouping genes having similar expression patterns is called gene clustering, which has been proved to be a useful tool for extracting underlying biological information of gene expression data. Many clustering procedures have shown success in microarray gene clustering; most of them belong to the family of heuristic clustering algorithms. Model-based algorithms are alternative clustering algorithms, which are based on the assumption that the whole set of microarray data is a finite mixture of a certain type of distributions with different parameters. Application of the model-based algorithms to unsupervised clustering has been reported. Here, for the first time, we demonstrated the use of the model-based algorithm in supervised clustering of microarray data.

Results: We applied the proposed methods to real gene expression data and simulated data. We showed that the supervised model-based algorithm is superior over the unsupervised method and the support vector machines (SVM) method.

Availability: The program written in the SAS language implementing methods I-III in this report is available upon request. The software of SVMs is available in the website http://svm.sdsc.edu/cgi-bin/nph-SVMsubmit.cgi

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Algorithms*
  • Artificial Intelligence
  • Cell Cycle Proteins / genetics
  • Cluster Analysis*
  • Gene Expression Profiling / methods*
  • Models, Genetic*
  • Models, Statistical
  • Multivariate Analysis
  • Normal Distribution
  • Oligonucleotide Array Sequence Analysis / methods*
  • Saccharomyces cerevisiae Proteins / genetics
  • Sequence Alignment / methods*
  • Sequence Analysis, DNA / methods*
  • Software

Substances

  • Cell Cycle Proteins
  • Saccharomyces cerevisiae Proteins