Machine Learning Approaches with Textural Features to Calculate Breast Density on Mammography

Curr Oncol. 2023 Jan 7;30(1):839-853. doi: 10.3390/curroncol30010064.

Abstract

Background: breast cancer (BC) is the world's most prevalent cancer in the female population, with 2.3 million new cases diagnosed worldwide in 2020. The great efforts made to set screening campaigns, early detection programs, and increasingly targeted treatments led to significant improvement in patients' survival. The Full-Field Digital Mammograph (FFDM) is considered the gold standard method for the early diagnosis of BC. From several previous studies, it has emerged that breast density (BD) is a risk factor in the development of BC, affecting the periodicity of screening plans present today at an international level.

Objective: in this study, the focus is the development of mammographic image processing techniques that allow the extraction of indicators derived from textural patterns of the mammary parenchyma indicative of BD risk factors.

Methods: a total of 168 patients were enrolled in the internal training and test set while a total of 51 patients were enrolled to compose the external validation cohort. Different Machine Learning (ML) techniques have been employed to classify breasts based on the values of the tissue density. Textural features were extracted only from breast parenchyma with which to train classifiers, thanks to the aid of ML algorithms.

Results: the accuracy of different tested classifiers varied between 74.15% and 93.55%. The best results were reached by a Support Vector Machine (accuracy of 93.55% and a percentage of true positives and negatives equal to TPP = 94.44% and TNP = 92.31%). The best accuracy was not influenced by the choice of the features selection approach. Considering the external validation cohort, the SVM, as the best classifier with the 7 features selected by a wrapper method, showed an accuracy of 0.95, a sensitivity of 0.96, and a specificity of 0.90.

Conclusions: our preliminary results showed that the Radiomics analysis and ML approach allow us to objectively identify BD.

Keywords: breast cancer; breast density; machine learning; mammography; radiomics.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Breast / diagnostic imaging
  • Breast Density*
  • Breast Neoplasms* / diagnostic imaging
  • Female
  • Humans
  • Machine Learning
  • Mammography / methods

Grants and funding

This study has been partially funded by IMS GIOTTO S.p.A. Sasso Marconi (BO), Italy. Special thanks to Sara Vecchio at IMS for useful discussions.