Observer agreement using the ACR Breast Imaging Reporting and Data System (BI-RADS)-ultrasound, First Edition (2003)

Korean J Radiol. 2007 Sep-Oct;8(5):397-402. doi: 10.3348/kjr.2007.8.5.397.

Abstract

Objective: This study aims to evaluate the degree of inter- and intraobserver agreement when characterizing breast abnormalities using the Breast Imaging Reporting and Data System (BI-RADS)-ultrasound (US) lexicon, as defined by the American College of Radiology (ACR).

Materials and methods: Two hundred ninety three female patients with 314 lesions underwent US-guided biopsies at one facility during a two-year period. Static sonographic images of each breast lesion were acquired and reviewed by four radiologists with expertise in breast imaging. Each radiologist independently evaluated all cases and described the mass according to BI-RADS-US. To assess intraobserver variability, one of the four radiologists reassessed all of the cases one month after the initial evaluation. Inter- and intraobserver variabilities were determined using Cohen's kappa (k) statistics.

Results: The greatest degree of reliability for a descriptor was found for mass orientation (k = 0.61) and the least concordance of fair was found for the mass margin (k = 0.32) and echo pattern (k = 0.36). Others descriptive terms: shape, lesion boundary and posterior features (k = 0.42, k = 0.55 and k = 0.53, respectively) and the final assessment (k = 0.51) demonstrated only moderate levels of agreement. A substantial degree of intraobserver agreement was found when classifying all morphologic features: shape, orientation, margin, lesion boundary, echo pattern and posterior feature (k = 0.73, k = 0.68, k = 0.64, 0.68, k = 0.65 and k = 0.64, respectively) and rendering final assessments (k = 0.65).

Conclusion: Although BI-RADS-US was created to achieve a consensus among radiologists when describing breast abnormalities, our study shows substantial intraobserver agreement but only moderate interobserver agreement in the mass description and final assessment of breast abnormalities according to its use. A better agreement will ultimately require specialized education, as well as self-auditing practice tests.

Publication types

  • Evaluation Study

MeSH terms

  • Adenocarcinoma / classification
  • Adenocarcinoma / diagnosis*
  • Adenocarcinoma, Mucinous / classification
  • Adenocarcinoma, Mucinous / diagnosis*
  • Adolescent
  • Adult
  • Aged
  • Aged, 80 and over
  • Biopsy
  • Breast Neoplasms / classification
  • Breast Neoplasms / diagnosis*
  • Carcinoma, Ductal, Breast / classification
  • Carcinoma, Ductal, Breast / diagnosis*
  • Carcinoma, Intraductal, Noninfiltrating / classification
  • Carcinoma, Intraductal, Noninfiltrating / diagnosis*
  • Female
  • Follow-Up Studies
  • Humans
  • Middle Aged
  • Observer Variation
  • Predictive Value of Tests
  • Radiology
  • Reproducibility of Results
  • Sensitivity and Specificity
  • Societies, Medical
  • Terminology as Topic
  • Ultrasonography, Doppler, Color / statistics & numerical data
  • Ultrasonography, Mammary / statistics & numerical data*