Denoising diffusion-based MRI to CT image translation enables automated spinal segmentation

Robert Graf; Joachim Schmitt; Sarah Schlaeger; Hendrik Kristian Möller; Vasiliki Sideri-Lampretsa; Anjany Sekuboyina; Sandro Manuel Krieg; Benedikt Wiestler; Bjoern Menze; Daniel Rueckert; Jan Stefan Kirschke

doi:10.1186/s41747-023-00385-2

Denoising diffusion-based MRI to CT image translation enables automated spinal segmentation

Eur Radiol Exp. 2023 Nov 14;7(1):70. doi: 10.1186/s41747-023-00385-2.

Affiliations

¹ Department of Diagnostic and Interventional Neuroradiology, School of Medicine, Technical University of Munich, Munich, Germany. robert.graf@tum.de.
² Department of Diagnostic and Interventional Neuroradiology, School of Medicine, Technical University of Munich, Munich, Germany.
³ Institut Für KI Und Informatik in Der Medizin, Klinikum Rechts Der Isar, Technical University of Munich, Munich, Germany.
⁴ Department of Quantitative Biomedicine, University of Zurich, Zurich, Switzerland.
⁵ Department of Neurosurgery, Klinikum Rechts Der Isar, School of Medicine, Technical University of Munich, Munich, Germany.
⁶ Visual Information Processing, Imperial College London, London, UK.

Abstract

Background: Automated segmentation of spinal magnetic resonance imaging (MRI) plays a vital role both scientifically and clinically. However, accurately delineating posterior spine structures is challenging.

Methods: This retrospective study, approved by the ethical committee, involved translating T1-weighted and T2-weighted images into computed tomography (CT) images in a total of 263 pairs of CT/MR series. Landmark-based registration was performed to align image pairs. We compared two-dimensional (2D) paired - Pix2Pix, denoising diffusion implicit models (DDIM) image mode, DDIM noise mode - and unpaired (SynDiff, contrastive unpaired translation) image-to-image translation using "peak signal-to-noise ratio" as quality measure. A publicly available segmentation network segmented the synthesized CT datasets, and Dice similarity coefficients (DSC) were evaluated on in-house test sets and the "MRSpineSeg Challenge" volumes. The 2D findings were extended to three-dimensional (3D) Pix2Pix and DDIM.

Results: 2D paired methods and SynDiff exhibited similar translation performance and DCS on paired data. DDIM image mode achieved the highest image quality. SynDiff, Pix2Pix, and DDIM image mode demonstrated similar DSC (0.77). For craniocaudal axis rotations, at least two landmarks per vertebra were required for registration. The 3D translation outperformed the 2D approach, resulting in improved DSC (0.80) and anatomically accurate segmentations with higher spatial resolution than that of the original MRI series.

Conclusions: Two landmarks per vertebra registration enabled paired image-to-image translation from MRI to CT and outperformed all unpaired approaches. The 3D techniques provided anatomically correct segmentations, avoiding underprediction of small structures like the spinous process.

Relevance statement: This study addresses the unresolved issue of translating spinal MRI to CT, making CT-based tools usable for MRI data. It generates whole spine segmentation, previously unavailable in MRI, a prerequisite for biomechanical modeling and feature extraction for clinical applications.

Key points: • Unpaired image translation lacks in converting spine MRI to CT effectively. • Paired translation needs registration with two landmarks per vertebra at least. • Paired image-to-image enables segmentation transfer to other domains. • 3D translation enables super resolution from MRI to CT. • 3D translation prevents underprediction of small structures.

Keywords: Deep learning; Image processing (computer assisted); Magnetic resonance imaging; Spine; Vertebral body.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Image Processing, Computer-Assisted* / methods
Magnetic Resonance Imaging / methods
Retrospective Studies
Spine / diagnostic imaging
Tomography, X-Ray Computed* / methods