Machine Learning Based Comparative Analysis for Breast Cancer Prediction

J Healthc Eng. 2022 Apr 11:2022:4365855. doi: 10.1155/2022/4365855. eCollection 2022.

Abstract

One of the most prevalent and leading causes of cancer in women is breast cancer. It has now become a frequent health problem, and its prevalence has recently increased. The easiest approach to dealing with breast cancer findings is to recognize them early on. Early detection of breast cancer is facilitated by computer-aided detection and diagnosis (CAD) technologies, which can help people live longer lives. The major goal of this work is to take advantage of recent developments in CAD systems and related methodologies. In 2011, the United States reported that one out of every eight women was diagnosed with cancer. Breast cancer originates as a result of aberrant cell division in the breast, which leads to either benign or malignant cancer formation. As a result, early detection of breast cancer is critical, and with effective treatment, many lives can be saved. This research covers the findings and analyses of multiple machine learning models for identifying breast cancer. The Wisconsin Breast Cancer Diagnostic (WBCD) dataset was used to develop the method. Despite its small size, the dataset provides some interesting data. The information was analyzed and put to use in a number of machine learning models. For prediction, random forest, logistic regression, decision tree, and K-nearest neighbor were utilized. When the results are compared, the logistic regression model is found to offer the best results. Logistic regression achieves 98% accuracy, which is better than the previous method reported.

Publication types

  • Research Support, Non-U.S. Gov't
  • Retracted Publication

MeSH terms

  • Breast
  • Breast Neoplasms* / diagnosis
  • Cluster Analysis
  • Female
  • Humans
  • Logistic Models
  • Machine Learning