Novel chaotic oppositional fruit fly optimization algorithm for feature selection applied on COVID 19 patients' health prediction

PLoS One. 2022 Oct 10;17(10):e0275727. doi: 10.1371/journal.pone.0275727. eCollection 2022.

Abstract

The fast-growing quantity of information hinders the process of machine learning, making it computationally costly and with substandard results. Feature selection is a pre-processing method for obtaining the optimal subset of features in a data set. Optimization algorithms struggle to decrease the dimensionality while retaining accuracy in high-dimensional data set. This article proposes a novel chaotic opposition fruit fly optimization algorithm, an improved variation of the original fruit fly algorithm, advanced and adapted for binary optimization problems. The proposed algorithm is tested on ten unconstrained benchmark functions and evaluated on twenty-one standard datasets taken from the Univesity of California, Irvine repository and Arizona State University. Further, the presented algorithm is assessed on a coronavirus disease dataset, as well. The proposed method is then compared with several well-known feature selection algorithms on the same datasets. The results prove that the presented algorithm predominantly outperform other algorithms in selecting the most relevant features by decreasing the number of utilized features and improving classification accuracy.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Arizona
  • COVID-19*
  • Drosophila
  • Machine Learning

Associated data

  • figshare/10.6084/m9.figshare.20538849

Grants and funding

This project is funded by King Saud University, Riyadh, Saudi Arabia. Research Supporting Project number (RSP-2021/323), King Saud University, Riyadh, Saudi Arabia.