Multiparametric MRI of the prostate: diagnostic performance and interreader agreement of two scoring systems

Br J Radiol. 2016 Jun;89(1062):20151056. doi: 10.1259/bjr.20151056. Epub 2016 Mar 23.

Abstract

Objective: To compare the diagnostic accuracies and interreader agreements of the Prostate Imaging Reporting and Data System (PI-RADS) v. 2 and University of California San Francisco (UCSF) multiparametric prostate MRI scale for diagnosing clinically significant prostate cancer.

Methods: This institutional review board-approved retrospective study included 49 males who had 1.5 T endorectal MRI and prostatectomy. Two radiologists scored suspicious lesions on MRI using PI-RADS v. 2 and the UCSF scale. Percent agreement, 2 × 2 tables and the area under the receiver operating characteristic curves (Az) were used to assess and compare the individual and overall scores of these scales. Interreader agreements were estimated with kappa statistics.

Results: Reader 1 (R1) detected 78 lesions, and Reader 2 (R2) detected 80 lesions. Both identified 52 of 65 significant cancers. The Az for PI-RADS v. 2 and UCSF scale for R1 were 0.68 and 0.69 [T2 weighted imaging (T2WI)], 0.75 and 0.68 [diffusion-weighted imaging (DWI)] and 0.64 and 0.72 (overall score), respectively, and were 0.72 and 0.75 (T2WI), 0.73 and 0.67 (DWI) and 0.66 and 0.75 (overall score) for R2. The dynamic contrast-enhanced percent agreements between scales were 100% (R1) and 95% (R2). PI-RADS v. 2 DWI of R1 performed better than UCSF DWI (Az = 0.75 vs Az = 0.68; p = 0.05); no other differences were found. The interreader agreements were higher for PI-RADS v. 2 (T2WI: 0.56 vs 0.42; DWI: 0.60 vs 0.46; overall: 0.61 vs 0.42). The UCSF approach to derive the overall PI-RADS v. 2 scores increased the Az for the identification of significant cancer (R1 to 0.76, p < 0.05; R2 to 0.71, p = 0.35).

Conclusion: Although PI-RADS v. 2 DWI score may have a higher discriminatory performance than the UCSF scale counterpart to diagnose clinically significant cancer, the utilization of the UCSF scale weighing system for the integration of PI-RADS v. 2 individual parameter scores improved the accuracy its overall score.

Advances in knowledge: PI-RADS v. 2 is moderately accurate for the identification of clinically significant prostate cancer, but the utilization of alternative approaches to derive the overall PI-RADS v. 2 score, including the one used by the UCSF system, may improve its diagnostic accuracy.

Publication types

  • Comparative Study
  • Evaluation Study

MeSH terms

  • Aged
  • Algorithms
  • Humans
  • Image Enhancement / methods
  • Image Enhancement / standards*
  • Image Interpretation, Computer-Assisted / methods
  • Image Interpretation, Computer-Assisted / standards*
  • Internationality
  • Magnetic Resonance Imaging / methods
  • Magnetic Resonance Imaging / standards*
  • Male
  • Middle Aged
  • Multimodal Imaging / methods
  • Multimodal Imaging / standards*
  • Observer Variation
  • Practice Guidelines as Topic
  • Prostate / diagnostic imaging*
  • Prostatic Neoplasms / diagnostic imaging*
  • Reproducibility of Results
  • Sensitivity and Specificity