Error detection model developed using a multi-task convolutional neural network in patient-specific quality assurance for volumetric-modulated arc therapy

Yuto Kimura; Noriyuki Kadoya; Yohei Oku; Tomohiro Kajikawa; Seiji Tomori; Keiichi Jingu

doi:10.1002/mp.15031

Error detection model developed using a multi-task convolutional neural network in patient-specific quality assurance for volumetric-modulated arc therapy

Med Phys. 2021 Sep;48(9):4769-4783. doi: 10.1002/mp.15031. Epub 2021 Jul 29.

Authors

Yuto Kimura^{1

2}, Noriyuki Kadoya¹, Yohei Oku², Tomohiro Kajikawa^{1

3}, Seiji Tomori^{1

4}, Keiichi Jingu¹

Affiliations

¹ Department of Radiation Oncology, Tohoku University Graduate School of Medicine, Sendai, Japan.
² Radiation Oncology Center, Ofuna Chuo Hospital, Kamakura, Japan.
³ Department of Radiology, Graduate School of Medical Science, Kyoto Prefectural University of Medicine, Kyoto, Japan.
⁴ Department of Radiology, National Hospital Organization Sendai Medical Center, Sendai, Japan.

PMID: 34101848
DOI: 10.1002/mp.15031

Abstract

Purpose: In patient-specific quality assurance (QA) for static beam intensity-modulated radiation therapy (IMRT), machine-learning-based dose analysis methods have been developed to identify the cause of an error as an alternative to gamma analysis. Although these new methods have revealed that the cause of the error can be identified by analyzing the dose distribution obtained from the two-dimensional detector, they have not been extended to the analysis of volumetric-modulated arc therapy (VMAT) QA. In this study, we propose a deep learning approach to detect various types of errors in patient-specific VMAT QA.

Methods: A total of 161 beams from 104 prostate VMAT plans were analyzed. All beams were measured using a cylindrical detector (Delta4; ScandiDos, Uppsala, Sweden), and predicted dose distributions in a cylindrical phantom were calculated using a treatment planning system (TPS). In addition to the error-free plan, we simulated 12 types of errors: two types of multileaf collimator positional errors (systematic or random leaf offset of 2 mm), two types of monitor unit (MU) scaling errors (±3%), two types of gantry rotation errors (±2° in clockwise and counterclockwise direction), and six types of phantom setup errors (±1 mm in lateral, longitudinal, and vertical directions). The error-introduced predicted dose distributions were created by editing the calculated dose distributions using a TPS with in-house software. Those 13 types of dose difference maps, consisting of an error-free map and 12 error maps, were created from the measured and predicted dose distributions and were used to train the convolutional neural network (CNN) model. Our model was a multi-task model that individually detected each of the 12 types of errors. Two datasets, Test sets 1 and 2, were prepared to evaluate the performance of the model. Test set 1 consisted of 13 types of dose maps used for training, whereas Test set 2 included the dose maps with 25 types of errors in addition to the error-free dose map. The dose map, which introduced 25 types of errors, was generated by combining two of the 12 types of simulated errors. For comparison with the performance of our model, gamma analysis was performed for Test sets 1 and 2 with the criteria set to 3%/2 mm and 2%/1 mm (dose difference/distance to agreement).

Results: For Test set 1, the overall accuracy of our CNN model, gamma analysis with the criteria set to 3%/2 mm, and gamma analysis with the criteria set to 2%/1 mm was 0.92, 0.19, and 0.81, respectively. Similarly, for Test set 2, the overall accuracy was 0.44, 0.42, and 0.95, respectively. Our model outperformed gamma analysis in the classification of dose maps containing a single type error, and the performance of our model was inferior in the classification of dose maps containing compound errors.

Conclusions: A multi-task CNN model for detecting errors in patient-specific VMAT QA using a cylindrical measuring device was constructed, and its performance was evaluated. Our results demonstrate that our model was effective in identifying the error type in the dose map for VMAT QA.

Keywords: convolutional neural network; deep learning; patient-specific QA; radiotherapy; volumetric-modulated radiation therapy.

MeSH terms

Humans
Machine Learning
Male
Neural Networks, Computer
Phantoms, Imaging
Quality Assurance, Health Care
Radiotherapy Dosage
Radiotherapy Planning, Computer-Assisted
Radiotherapy, Intensity-Modulated*