TCU-Net: Transformer Embedded in Convolutional U-Shaped Network for Retinal Vessel Segmentation

Zidi Shi; Yu Li; Hua Zou; Xuedong Zhang

doi:10.3390/s23104897

TCU-Net: Transformer Embedded in Convolutional U-Shaped Network for Retinal Vessel Segmentation

Sensors (Basel). 2023 May 19;23(10):4897. doi: 10.3390/s23104897.

Authors

Zidi Shi¹, Yu Li¹, Hua Zou², Xuedong Zhang³

Affiliations

¹ School of Electronic and Electrical Engineering, Wuhan Textile University, Wuhan 430077, China.
² School of Computer Science, Wuhan University, Wuhan 430072, China.
³ School of Information Engineering, Tarim University, Alaer 843300, China.

Abstract

Optical coherence tomography angiography (OCTA) provides a detailed visualization of the vascular system to aid in the detection and diagnosis of ophthalmic disease. However, accurately extracting microvascular details from OCTA images remains a challenging task due to the limitations of pure convolutional networks. We propose a novel end-to-end transformer-based network architecture called TCU-Net for OCTA retinal vessel segmentation tasks. To address the loss of vascular features of convolutional operations, an efficient cross-fusion transformer module is introduced to replace the original skip connection of U-Net. The transformer module interacts with the encoder's multiscale vascular features to enrich vascular information and achieve linear computational complexity. Additionally, we design an efficient channel-wise cross attention module to fuse the multiscale features and fine-grained details from the decoding stages, resolving the semantic bias between them and enhancing effective vascular information. This model has been evaluated on the dedicated Retinal OCTA Segmentation (ROSE) dataset. The accuracy values of TCU-Net tested on the ROSE-1 dataset with SVC, DVC, and SVC+DVC are 0.9230, 0.9912, and 0.9042, respectively, and the corresponding AUC values are 0.9512, 0.9823, and 0.9170. For the ROSE-2 dataset, the accuracy and AUC are 0.9454 and 0.8623, respectively. The experiments demonstrate that TCU-Net outperforms state-of-the-art approaches regarding vessel segmentation performance and robustness.

Keywords: TCU-Net; channel cross-attention; efficient cross-scale transformer; retinal vessel segmentation.

MeSH terms

Angiography
Retinal Vessels* / diagnostic imaging
Tomography, Optical Coherence

Abstract

MeSH terms

Grants and funding