A Novel Feature Selection Method for High-Dimensional Biomedical Data Based on an Improved Binary Clonal Flower Pollination Algorithm

Hum Hered. 2019;84(1):34-46. doi: 10.1159/000501652. Epub 2019 Aug 29.

Abstract

In the biomedical field, large amounts of biological and clinical data have been accumulated rapidly, which can be analyzed to emphasize the assessment of at-risk patients and improve diagnosis. However, a major challenge encountered associated with biomedical data analysis is the so-called "curse of dimensionality." For this issue, a novel feature selection method based on an improved binary clonal flower pollination algorithm is proposed to eliminate unnecessary features and ensure a highly accurate classification of disease. The absolute balance group strategy and adaptive Gaussian mutation are adopted, which can increase the diversity of the population and improve the search performance. The KNN classifier is used to evaluate the classification accuracy. Extensive experimental results in six, publicly available, high-dimensional, biomedical datasets show that the proposed method can obtain high classification accuracy and outperforms other state-of-the-art methods.

Keywords: Absolute balance group strategy; Adaptive Gaussian mutation; Clonal flower pollination algorithm; Feature selection; Microarray datasets.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Flowers / physiology
  • Humans
  • Neoplasms / classification
  • Neoplasms / genetics
  • Nervous System
  • Pollination