Deep learning based time-to-event analysis with PET, CT and joint PET/CT for head and neck cancer prognosis

Comput Methods Programs Biomed. 2022 Jul:222:106948. doi: 10.1016/j.cmpb.2022.106948. Epub 2022 Jun 9.

Abstract

Objectives: Recent studies have shown that deep learning based on pre-treatment positron emission tomography (PET) or computed tomography (CT) is promising for distant metastasis (DM) and overall survival (OS) prognosis in head and neck cancer (HNC). However, lesion segmentation is typically required, resulting in a predictive power susceptible to variations in primary and lymph node gross tumor volume (GTV) segmentation. This study aimed at achieving prognosis without GTV segmentation, and extending single modality prognosis to joint PET/CT to allow investigating the predictive performance of combined- compared to single-modality inputs.

Methods: We employed a 3D-Resnet combined with a time-to-event outcome model to incorporate censoring information. We focused on the prognosis of DM and OS for HNC patients. For each clinical endpoint, five models with PET and/or CT images as input were compared: PET-GTV, PET-only, CT-GTV, CT-only, and PET/CT-GTV models, where -GTV indicates that the corresponding images were masked using the GTV contour. Publicly available delineated CT and PET scans from 4 different Canadian hospitals (293) and the MAASTRO clinic (74) were used for training by 3-fold cross-validation (CV). For independent testing, we used 110 patients from a collaborating institution. The predictive performance was evaluated via Harrell's Concordance Index (HCI) and Kaplan-Meier curves.

Results: In a 5-year time-to-event analysis, all models could produce CV HCIs with median values around 0.8 for DM and 0.7 for OS. The best performance was obtained with the PET-only model, achieving a median testing HCI of 0.82 for DM and 0.69 for OS. Compared with the PET/CT-GTV model, the PET-only still had advantages of up to 0.07 in terms of testing HCI. The Kaplan-Meier curves and corresponding log-rank test results also demonstrated significant stratification capability of our models for the testing cohort.

Conclusion: Deep learning-based DM and OS time-to-event models showed predictive capability and could provide indications for personalized RT. The best predictive performance achieved by the PET-only model suggested GTV segmentation might be less relevant for PET-based prognosis.

Keywords: Deep-learning; Head-and-neck cancer; PET/CT.

MeSH terms

  • Canada
  • Deep Learning*
  • Fluorodeoxyglucose F18
  • Head and Neck Neoplasms* / diagnostic imaging
  • Humans
  • Positron Emission Tomography Computed Tomography
  • Positron-Emission Tomography / methods
  • Prognosis
  • Radiopharmaceuticals
  • Tomography, X-Ray Computed / methods

Substances

  • Radiopharmaceuticals
  • Fluorodeoxyglucose F18