Automated segmentation of an intensity calibration phantom in clinical CT images using a convolutional neural network

Keisuke Uemura; Yoshito Otake; Masaki Takao; Mazen Soufi; Akihiro Kawasaki; Nobuhiko Sugano; Yoshinobu Sato

doi:10.1007/s11548-021-02345-w

Automated segmentation of an intensity calibration phantom in clinical CT images using a convolutional neural network

Int J Comput Assist Radiol Surg. 2021 Nov;16(11):1855-1864. doi: 10.1007/s11548-021-02345-w. Epub 2021 Mar 17.

Authors

Keisuke Uemura^{1

2}, Yoshito Otake³, Masaki Takao⁴, Mazen Soufi³, Akihiro Kawasaki³, Nobuhiko Sugano⁵, Yoshinobu Sato³

Affiliations

¹ Division of Information Science, Graduate School of Science and Technology, Nara Institute of Science and Technology, Ikoma, Nara, Japan. keisuke-uemura@is.naist.jp.
² Department of Orthopaedic Medical Engineering, Osaka University Graduate School of Medicine, Suita, Osaka, Japan. keisuke-uemura@is.naist.jp.
³ Division of Information Science, Graduate School of Science and Technology, Nara Institute of Science and Technology, Ikoma, Nara, Japan.
⁴ Department of Orthopaedics, Osaka University Graduate School of Medicine, Suita, Osaka, Japan.
⁵ Department of Orthopaedic Medical Engineering, Osaka University Graduate School of Medicine, Suita, Osaka, Japan.

PMID: 33730352
DOI: 10.1007/s11548-021-02345-w

Abstract

Purpose: In quantitative computed tomography (CT), manual selection of the intensity calibration phantom's region of interest is necessary for calculating density (mg/cm³) from the radiodensity values (Hounsfield units: HU). However, as this manual process requires effort and time, the purposes of this study were to develop a system that applies a convolutional neural network (CNN) to automatically segment intensity calibration phantom regions in CT images and to test the system in a large cohort to evaluate its robustness.

Methods: This cross-sectional, retrospective study included 1040 cases (520 each from two institutions) in which an intensity calibration phantom (B-MAS200, Kyoto Kagaku, Kyoto, Japan) was used. A training dataset was created by manually segmenting the phantom regions for 40 cases (20 cases for each institution). The CNN model's segmentation accuracy was assessed with the Dice coefficient, and the average symmetric surface distance was assessed through fourfold cross-validation. Further, absolute difference of HU was compared between manually and automatically segmented regions. The system was tested on the remaining 1000 cases. For each institution, linear regression was applied to calculate the correlation coefficients between HU and phantom density.

Results: The source code and the model used for phantom segmentation can be accessed at https://github.com/keisuke-uemura/CT-Intensity-Calibration-Phantom-Segmentation . The median Dice coefficient was 0.977, and the median average symmetric surface distance was 0.116 mm. The median absolute difference of the segmented regions between manual and automated segmentation was 0.114 HU. For the test cases, the median correlation coefficients were 0.9998 and 0.999 for the two institutions, with a minimum value of 0.9863.

Conclusion: The proposed CNN model successfully segmented the calibration phantom regions in CT images with excellent accuracy.

Keywords: Artificial intelligence; Bone mineral density; Deep learning; Phantom segmentation; Quantitative computed tomography; U-net.

MeSH terms

Calibration
Cross-Sectional Studies
Humans
Image Processing, Computer-Assisted
Neural Networks, Computer*
Retrospective Studies
Tomography, X-Ray Computed*

Abstract

MeSH terms

Grants and funding