Mapping physiological states from microarray expression measurements

Bioinformatics. 2002 Aug;18(8):1054-63. doi: 10.1093/bioinformatics/18.8.1054.

Abstract

Motivation: The increasing use of DNA microarrays to probe cell physiology requires methods for visualizing different expression phenotypes and explicitly connecting individual genes to discriminating expression features. Such methods should be robust and maintain biological interpretability.

Results: We propose a method for the mapping of the physiological state of cells and tissues from multidimensional expression data such as those obtained with DNA microarrays. The method uses Fisher discriminant analysis to create a linear projection of gene expression measurements that maximizes the separation of different sample classes. Relative to other typical classification methods, this method provides insights into the discriminating characteristics of expression measurements in terms of the contribution of individual genes to the definition of distinct physiological states. This projection method also facilitates visualization of classification results in a reduced dimensional space. Examples from four different cases demonstrate the ability of the method to produce well-separated groups in the projection space and to identify important genes for defining physiological states. The method can be augmented to also include data from the proteomic and metabolic phenotypes and can be useful in disease diagnosis, drug screening and bioprocessing applications.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Algorithms
  • Cluster Analysis
  • Cyanobacteria / genetics
  • Cyanobacteria / physiology
  • DNA / analysis
  • DNA / classification*
  • DNA / genetics
  • DNA / physiology*
  • Databases, Genetic
  • Discriminant Analysis
  • Gene Expression / genetics
  • Gene Expression / physiology*
  • Gene Expression Profiling / methods*
  • Gene Expression Regulation
  • Models, Biological*
  • Models, Statistical
  • Neoplasms, Glandular and Epithelial / genetics
  • Neoplasms, Glandular and Epithelial / physiopathology
  • Oligonucleotide Array Sequence Analysis / methods*
  • Quality Control
  • Reproducibility of Results
  • Sensitivity and Specificity

Substances

  • DNA