Regularized linear discriminant analysis and its application in microarrays

Biostatistics. 2007 Jan;8(1):86-100. doi: 10.1093/biostatistics/kxj035. Epub 2006 Apr 7.

Abstract

In this paper, we introduce a modified version of linear discriminant analysis, called the "shrunken centroids regularized discriminant analysis" (SCRDA). This method generalizes the idea of the "nearest shrunken centroids" (NSC) (Tibshirani and others, 2003) into the classical discriminant analysis. The SCRDA method is specially designed for classification problems in high dimension low sample size situations, for example, microarray data. Through both simulated data and real life data, it is shown that this method performs very well in multivariate classification problems, often outperforms the PAM method (using the NSC algorithm) and can be as competitive as the support vector machines classifiers. It is also suitable for feature elimination purpose and can be used as gene selection method. The open source R package for this method (named "rda") is available on CRAN (http://www.r-project.org) for download and testing.

Publication types

  • Comparative Study

MeSH terms

  • Computer Simulation
  • DNA, Neoplasm / genetics
  • Discriminant Analysis*
  • Gene Expression Profiling / methods
  • Humans
  • Linear Models*
  • Neoplasms / classification
  • Neoplasms / genetics
  • Oligonucleotide Array Sequence Analysis / methods*

Substances

  • DNA, Neoplasm