A Novel Bioinspired Algorithm for Mixed and Incomplete Breast Cancer Data Classification

Int J Environ Res Public Health. 2023 Feb 13;20(4):3240. doi: 10.3390/ijerph20043240.

Abstract

The pre-diagnosis of cancer has been approached from various perspectives, so it is imperative to continue improving classification algorithms to achieve early diagnosis of the disease and improve patient survival. In the medical field, there are data that, for various reasons, are lost. There are also datasets that mix numerical and categorical values. Very few algorithms classify datasets with such characteristics. Therefore, this study proposes the modification of an existing algorithm for the classification of cancer. The said algorithm showed excellent results compared with classical classification algorithms. The AISAC-MMD (Mixed and Missing Data) is based on the AISAC and was modified to work with datasets with missing and mixed values. It showed significantly better performance than bio-inspired or classical classification algorithms. Statistical analysis established that the AISAC-MMD significantly outperformed the Nearest Neighbor, C4.5, Naïve Bayes, ALVOT, Naïve Associative Classifier, AIRS1, Immunos1, and CLONALG algorithms in conducting breast cancer classification.

Keywords: artificial intelligence; bio-inspired algorithms; breast cancer; machine learning.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Bayes Theorem
  • Breast Neoplasms*
  • Cluster Analysis
  • Female
  • Humans
  • Support Vector Machine

Grants and funding

This research received no external funding.