Training the ACRIN 6666 Investigators and effects of feedback on breast ultrasound interpretive performance and agreement in BI-RADS ultrasound feature analysis

AJR Am J Roentgenol. 2012 Jul;199(1):224-35. doi: 10.2214/AJR.11.7324.

Abstract

Objective: Qualification tasks in mammography and breast ultrasound were developed for the American College of Radiology Imaging Network (ACRIN) 6666 Investigators. We sought to assess the effects of feedback on breast ultrasound interpretive performance and agreement in BI-RADS feature analysis among a subset of these experienced observers.

Materials and methods: After a 1-hour didactic session on BI-RADS: Ultrasound, an interpretive skills quiz set of 70 orthogonal sets of breast ultrasound images including 25 (36%) malignancies was presented to 100 experienced breast imaging observers. Thirty-five observers reviewed the quiz set twice: first without and then with immediate feedback of consensus feature analysis, management recommendations, and pathologic truth. Observer performance (sensitivity, specificity, area under the curve [AUC]) was calculated without feedback and with feedback. Kappas were determined for agreement on feature analysis and assessments.

Results: For 35 observers without feedback, the mean sensitivity was 89% (range, 68-100%); specificity, 62% (range, 42-82%); and AUC, 82% (range, 73-89%). With feedback, the mean sensitivity was 93% (range, 80-100%; mean increase, 4%; range of increase, 0-12%; p < 0.0001), the mean specificity was 61% (range, 45-73%; mean decrease, 1%; range of change, -18% to 11%; p = 0.19), and the mean AUC was 84% (range, 78-90%; mean increase, 2%; range of change, -3% to 9%; p < 0.0001). Three breast imagers in the lowest quartile of initial performance showed the greatest improvement in sensitivity with no change or improvement in AUC. The kappa values for feature analysis did not change, but there was improved agreement about final assessments, with the kappa value increasing from 0.53 (SE, 0.02) without feedback to 0.59 (SE, 0.02) with feedback (p < 0.0001).

Conclusion: Most experienced breast imagers showed excellent breast ultrasound interpretive skills. Immediate feedback of consensus BI-RADS: Ultrasound features and histopathologic results improved performance in ultrasound interpretation across all experience variables.

Trial registration: ClinicalTrials.gov NCT00072501.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Area Under Curve
  • Biopsy
  • Breast Diseases / diagnostic imaging
  • Breast Neoplasms / diagnostic imaging*
  • Breast Neoplasms / pathology
  • Diagnosis, Differential
  • Feedback
  • Female
  • Follow-Up Studies
  • Humans
  • Models, Statistical
  • Phantoms, Imaging
  • Practice Patterns, Physicians' / statistics & numerical data*
  • Reproducibility of Results
  • Research Personnel / education
  • Research Personnel / statistics & numerical data*
  • Sensitivity and Specificity
  • Task Performance and Analysis
  • Ultrasonography, Mammary / methods*
  • Ultrasonography, Mammary / statistics & numerical data*
  • United States

Associated data

  • ClinicalTrials.gov/NCT00072501