Gene Selection via a New Hybrid Ant Colony Optimization Algorithm for Cancer Classification in High-Dimensional Data

Comput Math Methods Med. 2019 Oct 13:2019:7828590. doi: 10.1155/2019/7828590. eCollection 2019.

Abstract

The recent advance in the microarray data analysis makes it easy to simultaneously measure the expression levels of several thousand genes. These levels can be used to distinguish cancerous tissues from normal ones. In this work, we are interested in gene expression data dimension reduction for cancer classification, which is a common task in most microarray data analysis studies. This reduction has an essential role in enhancing the accuracy of the classification task and helping biologists accurately predict cancer in the body; this is carried out by selecting a small subset of relevant genes and eliminating the redundant or noisy genes. In this context, we propose a hybrid approach (MWIS-ACO-LS) for the gene selection problem, based on the combination of a new graph-based approach for gene selection (MWIS), in which we seek to minimize the redundancy between genes by considering the correlation between the latter and maximize gene-ranking (Fisher) scores, and a modified ACO coupled with a local search (LS) algorithm using the classifier 1NN for measuring the quality of the candidate subsets. In order to evaluate the proposed method, we tested MWIS-ACO-LS on ten well-replicated microarray datasets of high dimensions varying from 2308 to 12600 genes. The experimental results based on ten high-dimensional microarray classification problems demonstrated the effectiveness of our proposed method.

MeSH terms

  • Algorithms*
  • Artificial Intelligence
  • Computational Biology
  • Gene Expression Profiling
  • Gene Expression Regulation, Neoplastic
  • Genes, Neoplasm*
  • Humans
  • Neoplasms / genetics*
  • Oligonucleotide Array Sequence Analysis
  • Pattern Recognition, Automated
  • Reproducibility of Results
  • Software
  • Support Vector Machine