Lightweight semantic segmentation network with configurable context and small object attention

Chunyu Zhang; Fang Xu; Chengdong Wu; Jinzhao Li

doi:10.3389/fncom.2023.1280640

Lightweight semantic segmentation network with configurable context and small object attention

Front Comput Neurosci. 2023 Oct 23:17:1280640. doi: 10.3389/fncom.2023.1280640. eCollection 2023.

Authors

Chunyu Zhang¹, Fang Xu², Chengdong Wu¹, Jinzhao Li³

Affiliations

¹ Faculty of Robot Science and Engineering, Northeastern University, Shenyang, China.
² Shenyang Siasun Robot & Automation Company Ltd., Shenyang, China.
³ Changchun Institute of Optics, Fine Mechanics and Physics, University of Chinese Academy of Sciences, Beijing, China.

Abstract

The current semantic segmentation algorithms suffer from encoding feature distortion and small object feature loss. Context information exchange can effectively address the feature distortion problem, but it has the issue of fixed spatial range. Maintaining the input feature resolution can reduce the loss of small object information but would slow down the network's operation speed. To tackle these problems, we propose a lightweight semantic segmentation network with configurable context and small object attention (CCSONet). CCSONet includes a long-short distance configurable context feature enhancement module (LSCFEM) and a small object attention decoding module (SOADM). The LSCFEM differs from the regular context exchange module by configuring long and short-range relevant features for the current feature, providing a broader and more flexible spatial range. The SOADM enhances the features of small objects by establishing correlations among objects of the same category, avoiding the introduction of redundancy issues caused by high-resolution features. On the Cityscapes and Camvid datasets, our network achieves the accuracy of 76.9 mIoU and 73.1 mIoU, respectively, while maintaining speeds of 87 FPS and 138 FPS. It outperforms other lightweight semantic segmentation algorithms in terms of accuracy.

Keywords: context feature enhancement; encoder-decoder; lightweight network; semantic segmentation; small object attention.

Grants and funding

The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.