The painter's feature selection for gene expression data

Annu Int Conf IEEE Eng Med Biol Soc. 2007:2007:4227-30. doi: 10.1109/IEMBS.2007.4353269.

Abstract

Feature selection is a fundamental task in microarray data analysis. It aims at identifying the genes which are mostly associated with a tissue category, disease state or clinical outcome. An effective feature selection reduces computation costs and increases classification accuracy. This paper presents a novel multi-class approach to feature selection for gene expression data, which is called Painter's approach. It has the benefits of both a parameter free technique and a native multicategory method. It consists of two phases. The first is a filtering phase that smooths the effect of noise and outliers, which represent a common problem in microarray data. In the second phase, the actual gene selection is performed. Preliminary experimental results on three public datasets are presented. They confirm the intuition of the proposed approach leading to high classification accuracies.

MeSH terms

  • Animals
  • Gene Expression Profiling / methods*
  • Humans
  • Models, Theoretical*
  • Oligonucleotide Array Sequence Analysis / methods*
  • Predictive Value of Tests
  • Software*