MRI brain tumor segmentation using residual Spatial Pyramid Pooling-powered 3D U-Net

Sanchit Vijay; Thejineaswar Guhan; Kathiravan Srinivasan; P M Durai Raj Vincent; Chuan-Yu Chang

doi:10.3389/fpubh.2023.1091850

MRI brain tumor segmentation using residual Spatial Pyramid Pooling-powered 3D U-Net

Front Public Health. 2023 Feb 2:11:1091850. doi: 10.3389/fpubh.2023.1091850. eCollection 2023.

Authors

Sanchit Vijay¹, Thejineaswar Guhan², Kathiravan Srinivasan³, P M Durai Raj Vincent², Chuan-Yu Chang^{4

5}

Affiliations

¹ School of Electronics Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India.
² School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India.
³ School of Computer Science and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India.
⁴ Department of Computer Science and Information Engineering, National Yunlin University of Science and Technology, Yunlin, Taiwan.
⁵ Service Systems Technology Center, Industrial Technology Research Institute, Hsinchu, Taiwan.

Abstract

Brain tumor diagnosis has been a lengthy process, and automation of a process such as brain tumor segmentation speeds up the timeline. U-Nets have been a commonly used solution for semantic segmentation, and it uses a downsampling-upsampling approach to segment tumors. U-Nets rely on residual connections to pass information during upsampling; however, an upsampling block only receives information from one downsampling block. This restricts the context and scope of an upsampling block. In this paper, we propose SPP-U-Net where the residual connections are replaced with a combination of Spatial Pyramid Pooling (SPP) and Attention blocks. Here, SPP provides information from various downsampling blocks, which will increase the scope of reconstruction while attention provides the necessary context by incorporating local characteristics with their corresponding global dependencies. Existing literature uses heavy approaches such as the usage of nested and dense skip connections and transformers. These approaches increase the training parameters within the model which therefore increase the training time and complexity of the model. The proposed approach on the other hand attains comparable results to existing literature without changing the number of trainable parameters over larger dimensions such as 160 × 192 × 192. All in all, the proposed model scores an average dice score of 0.883 and a Hausdorff distance of 7.84 on Brats 2021 cross validation.

Keywords: 3D U-Net; Spatial Pyramid Pooling; brain tumor segmentation; healthcare; image processing.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Brain
Brain Neoplasms* / pathology
Humans
Image Processing, Computer-Assisted / methods
Magnetic Resonance Imaging / methods
Neural Networks, Computer*

Grants and funding

This research was partially funded by Intelligent Recognition Industry Service Research Center from the Featured Areas Research Center Program within the framework of the Higher Education Sprout Project by the Ministry of Education (MOE) in Taiwan and Ministry of Science and Technology in Taiwan (Grant No. MOST 109-2221-E-224-048-MY2).