Comparison of visual criteria for amyloid-PET reading: could criteria merging reduce inter-rater variability?

Q J Nucl Med Mol Imaging. 2020 Dec;64(4):414-421. doi: 10.23736/S1824-4785.19.03124-8. Epub 2019 May 8.

Abstract

Background: Three different amyloid tracers labeled with 18-flourine have been introduced into clinical use. The leaflets of tracers indicate different visual criteria for PET reporting. In clinical practice, it is not yet ascertained whether these criteria are equivalent in terms of diagnostic accuracy or if anyone is better than another. We aimed to evaluate the inter and intra-rater variability of visual assessment of 18F-Florbetapir PET/CT images among six independent readers with different clinical experience.

Methods: We analyzed 252 PET/CT scans, visually assessed by each reader three times, applying independently the three different reading criteria proposed. Each reader evaluated the regional uptake specifying for each cortical region a numeric value of grading of positivity in order to assign a final score. At the end of each reading a level of confidence was determined by assigning a score from 0 (negative) to 4 (positive). After first reading, those cases in which the evaluations by two experienced readers did not match (discordant cases) were independently reevaluated merging all the three different visual interpretation criteria.

Results: Good agreement was observed for visual interpretation among the six readers' confidence-level using independently the three visual reading criteria: ICC=0.83 (0.80-0.86) for 18F-florbetapir, ICC=0.84 (0.81-0.87) for 18F-florbetaben, and ICC=0.86 (0.83-0.88) for 18F-flutemetamol reading. A good inter-rater agreement was observed for final-score too: ICC=0.74 (0.70-0.78) for 18F-florbetapir; ICC=0.82 (0.79-0.85) for 18F-florbetaben; ICC=0.84 (0.81-0.87) for 18F-flutemetamol. Intra-rater agreement was good for final-score (from 0.76 to 0.90; P<0.001) and confidence-level (Spearman's rho from 0.89 to 1.00; P<0.001). Disagreement between the two experienced readers was observed in 22 of 252 cases (9%). The agreement converged over a second round of independent reading in 12 of 22 cases (54%), by merging all the criteria.

Conclusions: All the criteria proposed are useful to determine the grading of positivity or negativity of amyloid deposition and their merging improves the diagnostic confidence and provides a better agreement.

MeSH terms

  • Aged
  • Aged, 80 and over
  • Alzheimer Disease / diagnostic imaging*
  • Alzheimer Disease / radiotherapy
  • Amyloid / metabolism*
  • Aniline Compounds / chemistry
  • Benzothiazoles / chemistry
  • Brain
  • Ethylene Glycols / chemistry
  • Fluorine Radioisotopes / chemistry*
  • Fluorine Radioisotopes / pharmacology
  • Humans
  • Image Interpretation, Computer-Assisted
  • Middle Aged
  • Positron Emission Tomography Computed Tomography / methods*
  • Stilbenes / chemistry

Substances

  • Amyloid
  • Aniline Compounds
  • Benzothiazoles
  • Ethylene Glycols
  • Fluorine Radioisotopes
  • Stilbenes
  • flutemetamol
  • florbetapir
  • Fluorine-18
  • 4-(N-methylamino)-4'-(2-(2-(2-fluoroethoxy)ethoxy)ethoxy)stilbene