An ensemble feature selection technique for cancer recognition

Biomed Mater Eng. 2014;24(1):1001-8. doi: 10.3233/BME-130897.

Abstract

Correlation-based feature selection (CFS) using neighborhood mutual information (NMI) and particle swarm optimization (PSO) are combined into an ensemble technique in this paper. Based on this observation, an efficient gene selection algorithm, denoted by NMICFS-PSO, is proposed. Several cancer recognition tasks are gathered for testing the proposed technique. Moreover, support vector machine (SVM), integrated with leave-one-out cross-validation and served as a classifier, is employed for six classification profiles to calculate the classification accuracy. Experimental results show that the proposed method can reduce the redundant features effectively and achieve superior performance. The classification accuracy obtained by our method is higher in five out of the six gene expression problems as compared with that of other classifi cation methods.

Keywords: Feature selection; neighborhood mutual information; particle swarm optimization; support vector machine.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Breast Neoplasms / diagnosis
  • Breast Neoplasms / genetics*
  • Female
  • Gene Expression Profiling*
  • Humans
  • Leukemia / diagnosis
  • Leukemia / genetics*
  • Reproducibility of Results
  • Software
  • Stochastic Processes
  • Support Vector Machine