MultiResUNet : Rethinking the U-Net architecture for multimodal biomedical image segmentation

Nabil Ibtehaz; M Sohel Rahman

doi:10.1016/j.neunet.2019.08.025

MultiResUNet : Rethinking the U-Net architecture for multimodal biomedical image segmentation

Neural Netw. 2020 Jan:121:74-87. doi: 10.1016/j.neunet.2019.08.025. Epub 2019 Sep 4.

Authors

Nabil Ibtehaz¹, M Sohel Rahman²

Affiliations

¹ Samsung R&D Institute, Bangladesh. Electronic address: n.ibtehaz@samsung.com.
² Department of CSE, BUET, ECE Building, West Palasi, Dhaka 1205, Bangladesh. Electronic address: msrahman@cse.buet.ac.bd.

PMID: 31536901
DOI: 10.1016/j.neunet.2019.08.025

Abstract

In recent years Deep Learning has brought about a breakthrough in Medical Image Segmentation. In this regard, U-Net has been the most popular architecture in the medical imaging community. Despite outstanding overall performance in segmenting multimodal medical images, through extensive experimentations on some challenging datasets, we demonstrate that the classical U-Net architecture seems to be lacking in certain aspects. Therefore, we propose some modifications to improve upon the already state-of-the-art U-Net model. Following these modifications, we develop a novel architecture, MultiResUNet, as the potential successor to the U-Net architecture. We have tested and compared MultiResUNet with the classical U-Net on a vast repertoire of multimodal medical images. Although only slight improvements in the cases of ideal images are noticed, remarkable gains in performance have been attained for the challenging ones. We have evaluated our model on five different datasets, each with their own unique challenges, and have obtained a relative improvement in performance of 10.15%, 5.07%, 2.63%, 1.41%, and 0.62% respectively. We have also discussed and highlighted some qualitatively superior aspects of MultiResUNet over classical U-Net that are not really reflected in the quantitative measures.

Keywords: Convolutional neural networks; Medical imaging; Semantic segmentation; U-Net.

MeSH terms

Deep Learning*
Humans
Image Processing, Computer-Assisted / methods
Imaging, Three-Dimensional / methods*
Magnetic Resonance Imaging / methods
Microscopy, Fluorescence / methods
Neural Networks, Computer*