A Gene Selection Method for Microarray Data Based on Binary PSO Encoding Gene-to-Class Sensitivity Information

IEEE/ACM Trans Comput Biol Bioinform. 2017 Jan-Feb;14(1):85-96. doi: 10.1109/TCBB.2015.2465906.

Abstract

Traditional gene selection methods for microarray data mainly considered the features' relevance by evaluating their utility for achieving accurate predication or exploiting data variance and distribution, and the selected genes were usually poorly explicable. To improve the interpretability of the selected genes as well as prediction accuracy, an improved gene selection method based on binary particle swarm optimization (BPSO) and prior information is proposed in this paper. In the proposed method, BPSO encoding gene-to-class sensitivity (GCS) information is used to perform gene selection. The gene-to-class sensitivity information, extracted from the samples by extreme learning machine (ELM), is encoded into the selection process in four aspects: initializing particles, updating the particles, modifying maximum velocity, and adopting mutation operation adaptively. Constrained by the gene-to-class sensitivity information, the new method can select functional gene subsets which are significantly sensitive to the samples' classes. With the few discriminative genes selected by the proposed method, ELM, K-nearest neighbor and support vector machine classifiers achieve much high prediction accuracy on five public microarray data, which in turn verifies the efficiency and effectiveness of the proposed gene selection method.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Biomimetics / methods*
  • Computer Simulation
  • Crowding
  • Gene Expression Profiling / methods*
  • Models, Biological*
  • Models, Statistical
  • Oligonucleotide Array Sequence Analysis / methods*
  • Reproducibility of Results
  • Sensitivity and Specificity
  • Support Vector Machine*