Training of deep cross-modality conversion models with a small data set, and their application in megavoltage CT to kilovoltage CT conversion

Sho Ozaki; Shizuo Kaji; Kanabu Nawa; Toshikazu Imae; Atsushi Aoki; Takahiro Nakamoto; Takeshi Ohta; Yuki Nozawa; Hideomi Yamashita; Akihiro Haga; Keiichi Nakagawa

doi:10.1002/mp.15626

Training of deep cross-modality conversion models with a small data set, and their application in megavoltage CT to kilovoltage CT conversion

Med Phys. 2022 Jun;49(6):3769-3782. doi: 10.1002/mp.15626. Epub 2022 Apr 17.

Authors

Affiliations

¹ Graduate School of Medicine, University of Tokyo, Tokyo, Japan.
² Institute of Mathematics for Industry, Kyushu University, Motooka, Nishi-ku, Fukuoka, Japan.
³ Department of Radiology, University of Tokyo Hospital, Tokyo, Japan.
⁴ Department of Biological Science and Engineering, Faculty of Health Sciences, Hokkaido University, Kita-ku, Sapporo, Hokkaido, Japan.
⁵ Graduate School of Biomedical Science, Tokushima University, Tokushima, Japan.

PMID: 35315529
DOI: 10.1002/mp.15626

Abstract

Purpose: In recent years, deep learning-based image processing has emerged as a valuable tool for medical imaging owing to its high performance. However, the quality of deep learning-based methods heavily relies on the amount of training data; the high cost of acquiring a large data set is a limitation to their utilization in medical fields. Herein, based on deep learning, we developed a computed tomography (CT) modality conversion method requiring only a few unsupervised images.

Methods: The proposed method is based on cycle-consistency generative adversarial network (CycleGAN) with several extensions tailored for CT images, which aims at preserving the structure in the processed images and reducing the amount of training data. This method was applied to realize the conversion of megavoltage computed tomography (MVCT) to kilovoltage computed tomography (kVCT) images. Training was conducted using several data sets acquired from patients with head and neck cancer. The size of the data sets ranged from 16 slices (two patients) to 2745 slices (137 patients) for MVCT and 2824 slices (98 patients) for kVCT.

Results: The required size of the training data was found to be as small as a few hundred slices. By statistical and visual evaluations, the quality improvement and structure preservation of the MVCT images converted by the proposed model were investigated. As a clinical benefit, it was observed by medical doctors that the converted images enhanced the precision of contouring.

Conclusions: We developed an MVCT to kVCT conversion model based on deep learning, which can be trained using only a few hundred unpaired images. The stability of the model against changes in data size was demonstrated. This study promotes the reliable use of deep learning in clinical medicine by partially answering commonly asked questions, such as "Is our data sufficient?" and "How much data should we acquire?"

Keywords: computed tomography; cross-modality conversion; deep learning; training data reduction.

MeSH terms

Cone-Beam Computed Tomography
Head and Neck Neoplasms*
Humans
Image Processing, Computer-Assisted / methods
Radiotherapy Planning, Computer-Assisted* / methods
Tomography, X-Ray Computed / methods

Abstract

MeSH terms

Grants and funding