Automated segmentation of an intensity calibration phantom in clinical CT images using a convolutional neural network

Int J Comput Assist Radiol Surg. 2021 Nov;16(11):1855-1864. doi: 10.1007/s11548-021-02345-w. Epub 2021 Mar 17.

Abstract

Purpose: In quantitative computed tomography (CT), manual selection of the intensity calibration phantom's region of interest is necessary for calculating density (mg/cm3) from the radiodensity values (Hounsfield units: HU). However, as this manual process requires effort and time, the purposes of this study were to develop a system that applies a convolutional neural network (CNN) to automatically segment intensity calibration phantom regions in CT images and to test the system in a large cohort to evaluate its robustness.

Methods: This cross-sectional, retrospective study included 1040 cases (520 each from two institutions) in which an intensity calibration phantom (B-MAS200, Kyoto Kagaku, Kyoto, Japan) was used. A training dataset was created by manually segmenting the phantom regions for 40 cases (20 cases for each institution). The CNN model's segmentation accuracy was assessed with the Dice coefficient, and the average symmetric surface distance was assessed through fourfold cross-validation. Further, absolute difference of HU was compared between manually and automatically segmented regions. The system was tested on the remaining 1000 cases. For each institution, linear regression was applied to calculate the correlation coefficients between HU and phantom density.

Results: The source code and the model used for phantom segmentation can be accessed at https://github.com/keisuke-uemura/CT-Intensity-Calibration-Phantom-Segmentation . The median Dice coefficient was 0.977, and the median average symmetric surface distance was 0.116 mm. The median absolute difference of the segmented regions between manual and automated segmentation was 0.114 HU. For the test cases, the median correlation coefficients were 0.9998 and 0.999 for the two institutions, with a minimum value of 0.9863.

Conclusion: The proposed CNN model successfully segmented the calibration phantom regions in CT images with excellent accuracy.

Keywords: Artificial intelligence; Bone mineral density; Deep learning; Phantom segmentation; Quantitative computed tomography; U-net.

MeSH terms

  • Calibration
  • Cross-Sectional Studies
  • Humans
  • Image Processing, Computer-Assisted
  • Neural Networks, Computer*
  • Retrospective Studies
  • Tomography, X-Ray Computed*