STatistically Assigned Response Criteria in Solid Tumors (STARCIST)

Cancer Imaging. 2015 Jul 31;15(1):9. doi: 10.1186/s40644-015-0042-4.

Abstract

Background: Several reproducibility studies have established good test-retest reliability of FDG-PET in various oncology settings. However, these studies are based on relatively short inter-scan periods of 1-3 days while, in contrast, response assessments based on FDG-PET in early phase drug trials are typically made over an interval of 2-3 weeks during the first treatment cycle. With focus on longer, on-treatment scan intervals, we develop a data-driven approach to calculate baseline-specific cutoff values to determine patient-level changes in glucose uptake that are unlikely to be explained by random variability. Our method takes into account the statistical nature of natural fluctuations in SUV as well as potential bias effects.

Methods: To assess variability in SUV over clinically relevant scan intervals for clinical trials, we analyzed baseline and follow-up FDG-PET scans with a median scan interval of 21 days from 53 advanced stage cancer patients enrolled in a Phase 1 trial. The 53 patients received a sub-pharmacologic drug dose and the trial data is treated as a 'test-retest' data set. A simulation-based tool is presented which takes as input baseline lesion SUVmax values, the variance of spurious changes in SUVmax between scans, the desired Type I error rate, and outputs lesion and patient based cut-off values. Bias corrections are included to account for variations in tracer uptake time.

Results: In the training data, changes in SUVmax follow an approximately zero-mean Gaussian distribution with constant variance across levels of the baseline measurements. Because of constant variance, the coefficient of variation is a decreasing function of the magnitude of baseline SUVmax. This finding is consistent with published results, but our data shows greater variability. Application of our method to NSCLC patients treated with erlotinib produces results distinct from those based on the EORTC criteria. Based on data presented here as well as previous repeatability studies, the proposed method has greater statistical power to detect a significant %-decrease on SUVmax compared to published criteria relying on symmetric thresholds.

Conclusions: Defining patient-specific, baseline dependent cut-off values based on the (null) distribution of naturally occurring fluctuations in glucose uptake enable identification of statistically significant changes in SUVmax. For lower baseline values, the produced cutoff values are notably asymmetric with relatively large changes (e.g. >50 %) required for statistical significance. For use with prospectively defined endpoints, the developed method enables the use of one-armed trials to detect pharmacodynamic drug effects based on FDG-PET. The clinical importance of changes in SUVmax is likely to remain dependent on both tumor biology and the type of treatment.

MeSH terms

  • Algorithms*
  • Biomarkers, Pharmacological
  • Carcinoma, Non-Small-Cell Lung / drug therapy
  • Fluorodeoxyglucose F18
  • Glucose / metabolism
  • Humans
  • Lung Neoplasms / drug therapy
  • Neoplasms / metabolism
  • Neoplasms / therapy*
  • Normal Distribution
  • Positron-Emission Tomography / standards*
  • Predictive Value of Tests
  • Radiopharmaceuticals
  • Tomography, X-Ray Computed / standards*
  • Treatment Outcome

Substances

  • Biomarkers, Pharmacological
  • Radiopharmaceuticals
  • Fluorodeoxyglucose F18
  • Glucose