Predicting and Classifying Breast Cancer Using Machine Learning

J Comput Biol. 2022 Jun;29(6):497-514. doi: 10.1089/cmb.2021.0236. Epub 2021 Dec 9.

Abstract

The proposed research work aims to develop a method to predict and classify breast cancer (BC) at an early stage. In this research, three models are developed, and their performance is compared against each other. The first model was built using one of the machine learning algorithms called support vector machine (SVM), the second model was built using a deep learning algorithm called convolutional neural networks (CNNs), and the third model combines CNNs with a transfer learning technique for delivering better results. The data set is provided by the BC Histopathological Image Classification (BreakHis). All models are trained on the training set with two main categories: benign tumor and malignant tumor. The malignant tumor category is divided into subsets of invasive carcinoma tumors and in situ carcinoma tumors. Furthermore, invasive carcinoma tumors are classified into grade 1, grade 2, or grade 3, where grade 3 is the highest and is more aggressive. The results show that the accuracies of biopsy image classification using SVM are 92%, the accuracy of CNN is 94%, and the accuracy of CNN using the transfer learning technique is 97%. The results of this research will be beneficial in the early diagnosis of BC and help doctors in making better decisions and medical interventions.

Keywords: breast cancer; classification; deep learning; machine learning; prediction.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Breast Neoplasms* / diagnostic imaging
  • Carcinoma*
  • Female
  • Humans
  • Machine Learning
  • Neural Networks, Computer
  • Support Vector Machine