A study of performance on microarray data sets for a classifier based on information theoretic learning

Neural Netw. 2011 Oct;24(8):888-96. doi: 10.1016/j.neunet.2011.05.010. Epub 2011 Jun 12.

Abstract

Gene-expression microarray is a novel technology that allows the examination of tens of thousands of genes at a time. For this reason, manual observation is not feasible and machine learning methods are progressing to face these new data. Specifically, since the number of genes is very high, feature selection methods have proven valuable to deal with these unbalanced-high dimensionality and low cardinality-data sets. In this work, the FVQIT (Frontier Vector Quantization using Information Theory) classifier is employed to classify twelve DNA gene-expression microarray data sets of different kinds of cancer. A comparative study with other well-known classifiers is performed. The proposed approach shows competitive results outperforming all other classifiers.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Artificial Intelligence
  • DNA, Neoplasm / genetics
  • Databases, Genetic*
  • Entropy
  • Fuzzy Logic
  • Humans
  • Information Theory*
  • Microarray Analysis / classification*
  • Microarray Analysis / methods
  • Models, Genetic
  • Models, Statistical
  • Reproducibility of Results
  • Software

Substances

  • DNA, Neoplasm