Predicting and Classifying Breast Cancer Using Machine Learning

Lina Alkhathlan; Abdul Khader Jilani Saudagar

doi:10.1089/cmb.2021.0236

Predicting and Classifying Breast Cancer Using Machine Learning

J Comput Biol. 2022 Jun;29(6):497-514. doi: 10.1089/cmb.2021.0236. Epub 2021 Dec 9.

Authors

Lina Alkhathlan¹, Abdul Khader Jilani Saudagar¹

Affiliation

¹ Information Systems Department, College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh, Saudi Arabia.

PMID: 34883032
DOI: 10.1089/cmb.2021.0236

Abstract

The proposed research work aims to develop a method to predict and classify breast cancer (BC) at an early stage. In this research, three models are developed, and their performance is compared against each other. The first model was built using one of the machine learning algorithms called support vector machine (SVM), the second model was built using a deep learning algorithm called convolutional neural networks (CNNs), and the third model combines CNNs with a transfer learning technique for delivering better results. The data set is provided by the BC Histopathological Image Classification (BreakHis). All models are trained on the training set with two main categories: benign tumor and malignant tumor. The malignant tumor category is divided into subsets of invasive carcinoma tumors and in situ carcinoma tumors. Furthermore, invasive carcinoma tumors are classified into grade 1, grade 2, or grade 3, where grade 3 is the highest and is more aggressive. The results show that the accuracies of biopsy image classification using SVM are 92%, the accuracy of CNN is 94%, and the accuracy of CNN using the transfer learning technique is 97%. The results of this research will be beneficial in the early diagnosis of BC and help doctors in making better decisions and medical interventions.

Keywords: breast cancer; classification; deep learning; machine learning; prediction.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Breast Neoplasms* / diagnostic imaging
Carcinoma*
Female
Humans
Machine Learning
Neural Networks, Computer
Support Vector Machine