Sharp U-Net: Depthwise convolutional network for biomedical image segmentation

Hasib Zunair; A Ben Hamza

doi:10.1016/j.compbiomed.2021.104699

Sharp U-Net: Depthwise convolutional network for biomedical image segmentation

Comput Biol Med. 2021 Sep:136:104699. doi: 10.1016/j.compbiomed.2021.104699. Epub 2021 Jul 29.

Authors

Hasib Zunair¹, A Ben Hamza²

Affiliations

¹ Concordia Institute for Information Systems Engineering, Concordia University, Montreal, QC, Canada.
² Concordia Institute for Information Systems Engineering, Concordia University, Montreal, QC, Canada. Electronic address: hamza@ciise.concordia.ca.

PMID: 34348214
DOI: 10.1016/j.compbiomed.2021.104699

Abstract

The U-Net architecture, built upon the fully convolutional network, has proven to be effective in biomedical image segmentation. However, U-Net applies skip connections to merge semantically different low- and high-level convolutional features, resulting in not only blurred feature maps, but also over- and under-segmented target regions. To address these limitations, we propose a simple, yet effective end-to-end depthwise encoder-decoder fully convolutional network architecture, called Sharp U-Net, for binary and multi-class biomedical image segmentation. The key rationale of Sharp U-Net is that instead of applying a plain skip connection, a depthwise convolution of the encoder feature map with a sharpening kernel filter is employed prior to merging the encoder and decoder features, thereby producing a sharpened intermediate feature map of the same size as the encoder map. Using this sharpening filter layer, we are able to not only fuse semantically less dissimilar features, but also to smooth out artifacts throughout the network layers during the early stages of training. Our extensive experiments on six datasets show that the proposed Sharp U-Net model consistently outperforms or matches the recent state-of-the-art baselines in both binary and multi-class segmentation tasks, while adding no extra learnable parameters. Furthermore, Sharp U-Net outperforms baselines that have more than three times the number of learnable parameters.

Keywords: Fully convolutional network; Semantic segmentation; Sharpening filter; Skip connections; U-Net.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Artifacts
Image Processing, Computer-Assisted*
Neural Networks, Computer*