MCEENet: Multi-Scale Context Enhancement and Edge-Assisted Network for Few-Shot Semantic Segmentation

Hongjie Zhou; Rufei Zhang; Xiaoyu He; Nannan Li; Yong Wang; Sheng Shen

doi:10.3390/s23062922

MCEENet: Multi-Scale Context Enhancement and Edge-Assisted Network for Few-Shot Semantic Segmentation

Sensors (Basel). 2023 Mar 8;23(6):2922. doi: 10.3390/s23062922.

Authors

Hongjie Zhou¹, Rufei Zhang², Xiaoyu He¹, Nannan Li², Yong Wang¹, Sheng Shen²

Affiliations

¹ School of Automation, Central South University, Changsha 410083, China.
² Beijing Institute of Control and Electronic Technology, Beijing 100038, China.

Abstract

Few-shot semantic segmentation has attracted much attention because it requires only a few labeled samples to achieve good segmentation performance. However, existing methods still suffer from insufficient contextual information and unsatisfactory edge segmentation results. To overcome these two issues, this paper proposes a multi-scale context enhancement and edge-assisted network (called MCEENet) for few-shot semantic segmentation. First, rich support and query image features were extracted, respectively, using two weight-shared feature extraction networks, each consisting of a ResNet and a Vision Transformer. Subsequently, a multi-scale context enhancement (MCE) module was proposed to fuse the features of ResNet and Vision Transformer, and further mine the contextual information of the image by using cross-scale feature fusion and multi-scale dilated convolutions. Furthermore, we designed an Edge-Assisted Segmentation (EAS) module, which fuses the shallow ResNet features of the query image and the edge features computed by the Sobel operator to assist in the final segmentation task. We experimented on the PASCAL-5i dataset to demonstrate the effectiveness of MCEENet; the results of the 1-shot setting and 5-shot setting on the PASCAL-5i dataset are 63.5% and 64.7%, which surpasses the state-of-the-art results by 1.4% and 0.6%, respectively.

Keywords: edge-assisted segmentation; few-shot semantic segmentation; multi-scale context enhancement.

Grants and funding

61976225/National Natural Science Foundation of China