STatistically Assigned Response Criteria in Solid Tumors (STARCIST)

Thomas Bengtsson; Sandra M Sanabria-Bohorquez; Timothy J McCarthy; David S Binns; Rodney J Hicks; Alex J de Crespigny

doi:10.1186/s40644-015-0042-4

STatistically Assigned Response Criteria in Solid Tumors (STARCIST)

Cancer Imaging. 2015 Jul 31;15(1):9. doi: 10.1186/s40644-015-0042-4.

Authors

Thomas Bengtsson¹, Sandra M Sanabria-Bohorquez², Timothy J McCarthy³, David S Binns⁴, Rodney J Hicks^{5

6}, Alex J de Crespigny⁷

Affiliations

¹ Biostatistics, Genentech Inc, 1 DNA Way, South San Francisco, CA, 94080, USA. thomasgb@gene.com.
² Clinical Imaging, Genentech Inc, South San Francisco, CA, USA. sanabria.sandra@gene.com.
³ Clinical Imaging, Pfizer Global R&D, Groton, CT, USA. timothy.j.mccarthy@pfizer.com.
⁴ The Sir Peter MacCallum Department of Oncology, the University of Melbourne, Parkville, VIC, Australia. David.Binns@petermac.org.
⁵ The Sir Peter MacCallum Department of Oncology, the University of Melbourne, Parkville, VIC, Australia. Rod.Hicks@petermac.org.
⁶ Cancer Imaging, the Peter MacCallum Cancer Centre, East Melbourne, VIC, Australia. Rod.Hicks@petermac.org.
⁷ Clinical Imaging, Genentech Inc, South San Francisco, CA, USA. decrespigny.alex@gene.com.

Abstract

Background: Several reproducibility studies have established good test-retest reliability of FDG-PET in various oncology settings. However, these studies are based on relatively short inter-scan periods of 1-3 days while, in contrast, response assessments based on FDG-PET in early phase drug trials are typically made over an interval of 2-3 weeks during the first treatment cycle. With focus on longer, on-treatment scan intervals, we develop a data-driven approach to calculate baseline-specific cutoff values to determine patient-level changes in glucose uptake that are unlikely to be explained by random variability. Our method takes into account the statistical nature of natural fluctuations in SUV as well as potential bias effects.

Methods: To assess variability in SUV over clinically relevant scan intervals for clinical trials, we analyzed baseline and follow-up FDG-PET scans with a median scan interval of 21 days from 53 advanced stage cancer patients enrolled in a Phase 1 trial. The 53 patients received a sub-pharmacologic drug dose and the trial data is treated as a 'test-retest' data set. A simulation-based tool is presented which takes as input baseline lesion SUVmax values, the variance of spurious changes in SUVmax between scans, the desired Type I error rate, and outputs lesion and patient based cut-off values. Bias corrections are included to account for variations in tracer uptake time.

Results: In the training data, changes in SUVmax follow an approximately zero-mean Gaussian distribution with constant variance across levels of the baseline measurements. Because of constant variance, the coefficient of variation is a decreasing function of the magnitude of baseline SUVmax. This finding is consistent with published results, but our data shows greater variability. Application of our method to NSCLC patients treated with erlotinib produces results distinct from those based on the EORTC criteria. Based on data presented here as well as previous repeatability studies, the proposed method has greater statistical power to detect a significant %-decrease on SUVmax compared to published criteria relying on symmetric thresholds.

Conclusions: Defining patient-specific, baseline dependent cut-off values based on the (null) distribution of naturally occurring fluctuations in glucose uptake enable identification of statistically significant changes in SUVmax. For lower baseline values, the produced cutoff values are notably asymmetric with relatively large changes (e.g. >50 %) required for statistical significance. For use with prospectively defined endpoints, the developed method enables the use of one-armed trials to detect pharmacodynamic drug effects based on FDG-PET. The clinical importance of changes in SUVmax is likely to remain dependent on both tumor biology and the type of treatment.

MeSH terms

Algorithms*
Biomarkers, Pharmacological
Carcinoma, Non-Small-Cell Lung / drug therapy
Fluorodeoxyglucose F18
Glucose / metabolism
Humans
Lung Neoplasms / drug therapy
Neoplasms / metabolism
Neoplasms / therapy*
Normal Distribution
Positron-Emission Tomography / standards*
Predictive Value of Tests
Radiopharmaceuticals
Tomography, X-Ray Computed / standards*
Treatment Outcome

Substances

Biomarkers, Pharmacological
Radiopharmaceuticals
Fluorodeoxyglucose F18
Glucose