A computer-aided diagnosis system for breast ultrasound based on weighted BI-RADS classes

Comput Methods Programs Biomed. 2018 Jan:153:33-40. doi: 10.1016/j.cmpb.2017.10.004. Epub 2017 Oct 3.

Abstract

Background and objective: Conventional computer-aided diagnosis (CAD) systems for breast ultrasound (BUS) are trained to classify pathological classes, that is, benign and malignant. However, from a clinical perspective, this kind of classification does not agree totally with radiologists' diagnoses. Usually, the tumors are assessed by using a BI-RADS (Breast Imaging-Reporting and Data System) category and, accordingly, a recommendation is emitted: annual study for category 2 (benign), six-month follow-up study for category 3 (probably benign), and biopsy for categories 4 and 5 (suspicious of malignancy). Hence, in this paper, a CAD system based on BI-RADS categories weighted by pathological information is presented. The goal is to increase the classification performance by reducing the common class imbalance found in pathological classes as well as to provide outcomes quite similar to radiologists' recommendations.

Methods: The BUS dataset considers 781 benign lesions and 347 malignant tumors proven by biopsy. Moreover, every lesion is associated to one BI-RADS category in the set {2, 3, 4, 5}. Thus, the dataset is split into three weighted classes: benign, BI-RADS 2 in benign lesions; probably benign, BI-RADS 3 and 4 in benign lesions; and malignant, BI-RADS 4 and 5 in malignant lesions. Thereafter, a random forest (RF) classifier, denoted by RFw, is trained to predict the weighted BI-RADS classes. In addition, for comparison purposes, a RF classifier is trained to predict pathological classes, denoted as RFp.

Results: The ability of the classifiers to predict the pathological classes is measured by the area under the ROC curve (AUC), sensitivity (SEN), and specificity (SPE). The RFw classifier obtained AUC=0.872,SEN=0.826, and SPE=0.919, whereas the RFp classifier reached AUC=0.868,SEN=0.808, and SPE=0.929. According to a one-way analysis of variance test, the RFw classifier statistically outperforms (p < 0.001) the RFp classifier in terms of the AUC and SEN. Moreover, the classification performance of RFw to predict weighted BI-RADS classes is given by the Matthews correlation coefficient that obtained 0.614.

Conclusions: The division of the classification problem into three classes reduces the imbalance between benign and malignant classes; thus, the sensitivity is increased without degrading the specificity. Therefore, the CAD based on weighted BI-RADS classes improves the classification performance of the conventional CAD systems. Additionally, the proposed approach has the advantage of being capable of providing a multiclass outcome related to radiologists' recommendations.

Keywords: BI-RADS; Breast cancer; Computer-aided diagnosis; Ultrasound.

MeSH terms

  • Adolescent
  • Adult
  • Aged
  • Aged, 80 and over
  • Breast / diagnostic imaging*
  • Breast / pathology
  • Breast Diseases / diagnostic imaging*
  • Breast Diseases / pathology
  • Diagnosis, Computer-Assisted*
  • Female
  • Humans
  • Middle Aged
  • Young Adult