TiM-Net: Transformer in M-Net for Retinal Vessel Segmentation

Hongbin Zhang; Xiang Zhong; Zhijie Li; Yanan Chen; Zhiliang Zhu; Jingqin Lv; Chuanxiu Li; Ying Zhou; Guangli Li

doi:10.1155/2022/9016401

TiM-Net: Transformer in M-Net for Retinal Vessel Segmentation

J Healthc Eng. 2022 Jul 11:2022:9016401. doi: 10.1155/2022/9016401. eCollection 2022.

Authors

Hongbin Zhang¹, Xiang Zhong¹, Zhijie Li¹, Yanan Chen², Zhiliang Zhu¹, Jingqin Lv¹, Chuanxiu Li³, Ying Zhou⁴, Guangli Li³

Affiliations

¹ School of Software, East China Jiaotong University, Nanchang, China.
² School of International, East China Jiaotong University, Nanchang, China.
³ School of Information Engineering, East China Jiaotong University, Nanchang, China.
⁴ Medical School, Nanchang University, Nanchang, China.

Abstract

retinal image is a crucial window for the clinical observation of cardiovascular, cerebrovascular, or other correlated diseases. Retinal vessel segmentation is of great benefit to the clinical diagnosis. Recently, the convolutional neural network (CNN) has become a dominant method in the retinal vessel segmentation field, especially the U-shaped CNN models. However, the conventional encoder in CNN is vulnerable to noisy interference, and the long-rang relationship in fundus images has not been fully utilized. In this paper, we propose a novel model called Transformer in M-Net (TiM-Net) based on M-Net, diverse attention mechanisms, and weighted side output layers to efficaciously perform retinal vessel segmentation. First, to alleviate the effects of noise, a dual-attention mechanism based on channel and spatial is designed. Then the self-attention mechanism in Transformer is introduced into skip connection to re-encode features and model the long-range relationship explicitly. Finally, a weighted SideOut layer is proposed for better utilization of the features from each side layer. Extensive experiments are conducted on three public data sets to show the effectiveness and robustness of our TiM-Net compared with the state-of-the-art baselines. Both quantitative and qualitative results prove its clinical practicality. Moreover, variants of TiM-Net also achieve competitive performance, demonstrating its scalability and generalization ability. The code of our model is available at https://github.com/ZX-ECJTU/TiM-Net.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms*
Fundus Oculi
Humans
Image Processing, Computer-Assisted* / methods
Neural Networks, Computer
Retinal Vessels / diagnostic imaging