LD-UNet: A long-distance perceptual model for segmentation of blurred boundaries in medical images

Comput Biol Med. 2024 Mar:171:108120. doi: 10.1016/j.compbiomed.2024.108120. Epub 2024 Feb 6.

Abstract

The blurriness of boundaries in medical image target regions hinders further improvement in automatic segmentation accuracy and is a challenging problem. To address this issue, we propose a model called long-distance perceptual UNet (LD-UNet), which has a powerful long-‍distance perception ability and can effectively perceive the semantic context of an entire image. Specifically, LD-UNet utilizes global and local long-distance induction modules, which endow the model with contextual semantic induction capabilities for long-distance feature dependencies. The modules perform long-distance semantic perception at the high and low stages of LD-UNet, respectively, effectively improving the accuracy of local blurred information assessment. We also propose a top-down deep supervision method to enhance the ability of the model to fit data. Then, extensive experiments on four types of tumor data with blurred boundaries are conducted. The dataset includes nasopharyngeal carcinoma, esophageal carcinoma, pancreatic carcinoma, and colorectal carcinoma. The dice similarity coefficient scores obtained by LD-UNet on the four datasets are 73.35%, 85.93%, 70.04%, and 82.71%. Experimental results demonstrate that LD-UNet is more effective in improving the segmentation accuracy of blurred boundary regions than other methods with long-distance perception, such as transformers. Among all models, LD-UNet achieves the best performance. By visualizing the feature dependency field of the models, we further explore the advantages of LD-UNet in segmenting blurred boundaries.

Keywords: Blurred boundary segmentation; Deep learning; Long-distance perceptual; Medical image segmentation.

MeSH terms

  • Colorectal Neoplasms*
  • Esophageal Neoplasms*
  • Humans
  • Image Processing, Computer-Assisted
  • Pancreatic Neoplasms*
  • Semantics