Machine learning analysis of breast ultrasound to classify triple negative and HER2+ breast cancer subtypes

Breast Dis. 2023;42(1):59-66. doi: 10.3233/BD-220018.

Abstract

Objectives: Early diagnosis of triple-negative (TN) and human epidermal growth factor receptor 2 positive (HER2+) breast cancer is important due to its increased risk of micrometastatic spread necessitating early treatment and for guiding targeted therapies. This study aimed to evaluate the diagnostic performance of machine learning (ML) classification of newly diagnosed breast masses into TN versus non-TN (NTN) and HER2+ versus HER2 negative (HER2-) breast cancer, using radiomic features extracted from grayscale ultrasound (US) b-mode images.

Materials and methods: A retrospective chart review identified 88 female patients who underwent diagnostic breast US imaging, had confirmation of invasive malignancy on pathology and receptor status determined on immunohistochemistry available. The patients were classified as TN, NTN, HER2+ or HER2- for ground-truth labelling. For image analysis, breast masses were manually segmented by a breast radiologist. Radiomic features were extracted per image and used for predictive modelling. Supervised ML classifiers included: logistic regression, k-nearest neighbour, and Naïve Bayes. Classification performance measures were calculated on an independent (unseen) test set. The area under the receiver operating characteristic curve (AUC), sensitivity (%), and specificity (%) were reported for each classifier.

Results: The logistic regression classifier demonstrated the highest AUC: 0.824 (sensitivity: 81.8%, specificity: 74.2%) for the TN sub-group and 0.778 (sensitivity: 71.4%, specificity: 71.6%) for the HER2 sub-group.

Conclusion: ML classifiers demonstrate high diagnostic accuracy in classifying TN versus NTN and HER2+ versus HER2- breast cancers using US images. Identification of more aggressive breast cancer subtypes early in the diagnostic process could help achieve better prognoses by prioritizing clinical referral and prompting adequate early treatment.

Keywords: HER2+ breast cancer; Machine learning; triple negative breast cancer; ultrasound.

MeSH terms

  • Breast Neoplasms* / diagnostic imaging
  • Breast Neoplasms* / metabolism
  • Breast Neoplasms* / pathology
  • Female
  • Humans
  • Machine Learning*
  • Middle Aged
  • Pilot Projects
  • Receptor, ErbB-2 / metabolism
  • Retrospective Studies
  • Triple Negative Breast Neoplasms / diagnostic imaging
  • Ultrasonography*

Substances

  • ERBB2 protein, human
  • Receptor, ErbB-2