AltWOA: Altruistic Whale Optimization Algorithm for feature selection on microarray datasets

Comput Biol Med. 2022 May:144:105349. doi: 10.1016/j.compbiomed.2022.105349. Epub 2022 Mar 10.

Abstract

The data-driven modern era has enabled the collection of large amounts of biomedical and clinical data. DNA microarray gene expression datasets have mainly gained significant attention to the research community owing to their ability to identify diseases through the "bio-markers" or specific alterations in the gene sequence that represent that particular disease (for example, different types of cancer). However, gene expression datasets are very high-dimensional, while only a few of those are "bio-markers". Meta-heuristic-based feature selection effectively filters out only the relevant genes from a large set of attributes efficiently to reduce data storage and computation requirements. To this end, in this paper, we propose an Altruistic Whale Optimization Algorithm (AltWOA) for the feature selection problem in high-dimensional microarray data. AltWOA is an improvement on the basic Whale Optimization Algorithm. We embed the concept of altruism in the whale population to help efficient propagation of candidate solutions that can reach the global optima over the iterations. Evaluation of the proposed method on eight high dimensional microarray datasets reveals the superiority of AltWOA compared to popular and classical techniques in the literature on the same datasets both in terms of accuracy and the final number of features selected. The relevant codes for the proposed approach are available publicly at https://github.com/Rohit-Kundu/AltWOA.

Keywords: Altruism; Cancer detection; Evolutionary meta-heuristic; Feature selection; Gene expression; Microarray data.

MeSH terms

  • Algorithms
  • Altruism
  • Animals
  • Gene Expression Profiling
  • Neoplasms* / genetics
  • Oligonucleotide Array Sequence Analysis
  • Whales* / genetics