Medical image segmentation model based on triple gate MultiLayer perceptron

Jingke Yan; Xin Wang; Jingye Cai; Qin Qin; Hao Yang; Qin Wang; Yao Cheng; Tian Gan; Hua Jiang; Jianhua Deng; Bingxu Chen

doi:10.1038/s41598-022-09452-x

Medical image segmentation model based on triple gate MultiLayer perceptron

Sci Rep. 2022 Apr 12;12(1):6103. doi: 10.1038/s41598-022-09452-x.

Authors

Jingke Yan¹, Xin Wang^#^{2

3

4}, Jingye Cai⁵, Qin Qin^#⁶, Hao Yang⁷, Qin Wang^#⁸, Yao Cheng⁹, Tian Gan^#¹⁰, Hua Jiang¹⁰, Jianhua Deng⁵, Bingxu Chen¹

Affiliations

¹ Guilin University of Electronic Technology, School of Marine Engineering, Beihai, 536000, China.
² Guilin University of Electronic Technology, School of Marine Engineering, Beihai, 536000, China. 304379506@qq.com.
³ University of Electronic Science and Technology of China,School of Information and Software Engineering, Chengdu, 610000, China. 304379506@qq.com.
⁴ Guilin University of Electronic Technology, School of Computer Science and Information Security, Guilin, 541004, China. 304379506@qq.com.
⁵ University of Electronic Science and Technology of China,School of Information and Software Engineering, Chengdu, 610000, China.
⁶ Guilin University of Electronic Technology, School of Marine Engineering, Beihai, 536000, China. qinqin@guet.edu.cn.
⁷ China Academy of Engineering Physics, Institute of Applied Electronics, Mianyang, 621900, China.
⁸ Basic Teaching Department, Guilin University of Electronic Technology, Beihai, 536000, China. 283252764@qq.com.
⁹ Southwest Jiaotong University, State Key Laboratory of Traction Power, Chengdu, 610000, China.
¹⁰ Guilin University of Electronic Technology, School of Computer Science and Information Security, Guilin, 541004, China.

^# Contributed equally.

Abstract

To alleviate the social contradiction between limited medical resources and increasing medical needs, the medical image-assisted diagnosis based on deep learning has become the research focus in Wise Information Technology of med. Most of the existing medical segmentation models based on Convolution or Transformer have achieved relatively sound effects. However, the Convolution-based model with a limited receptive field cannot establish long-distance dependencies between features as the Network deepens. The Transformer-based model produces large computation overhead and cannot generalize the bias of local features and perceive the position feature of medical images, which are essential in medical image segmentation. To address those issues, we present Triple Gate MultiLayer Perceptron U-Net (TGMLP U-Net), a medical image segmentation model based on MLP, in which we design the Triple Gate MultiLayer Perceptron (TGMLP), composed of three parts. Firstly, considering encoding the position information of features, we propose the Triple MLP module based on MultiLayer Perceptron in this model. It uses linear projection to encode features from the high, wide, and channel dimensions, enabling the model to capture the long-distance dependence of features along the spatial dimension and the precise position information of features in three dimensions with less computational overhead. Then, we design the Local Priors and Global Perceptron module. The Global Perceptron divides the feature map into different partitions and conducts correlation modelling for each partition to establish the global dependency between partitions. The Local Priors uses multi-scale Convolution with high local feature extraction ability to explore further the relationship of context feature information within the structure. At last, we suggest a Gate-controlled Mechanism to effectively solves the problem that the dependence of position embeddings between Patches and within Patches in medical images cannot be well learned due to the relatively small number of samples in medical images segmentation data. Experimental results indicate that the proposed model outperforms other state-of-the-art models in most evaluation indicators, demonstrating its excellent performance in segmenting medical images.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Electric Power Supplies
Image Processing, Computer-Assisted* / methods
Neural Networks, Computer*
Sound