Multiple-Attention Mechanism Network for Semantic Segmentation

Sensors (Basel). 2022 Jun 13;22(12):4477. doi: 10.3390/s22124477.

Abstract

Contextual information and the dependencies between dimensions is vital in image semantic segmentation. In this paper, we propose a multiple-attention mechanism network (MANet) for semantic segmentation in a very effective and efficient way. Concretely, the contributions are as follows: (1) a novel dual-attention mechanism for capturing feature dependencies in spatial and channel dimensions, where the adjacent position attention captures the dependencies between pixels well; (2) a new cross-dimensional interactive attention feature fusion module, which strengthens the fusion of fine location structure information in low-level features and category semantic information in high-level features. We conduct extensive experiments on semantic segmentation benchmarks including PASCAL VOC 2012 and Cityscapes datasets. Our MANet achieves the mIoU scores of 75.5% and 72.8% on PASCAL VOC 2012 and Cityscapes datasets, respectively. The effectiveness of the network is higher than the previous popular semantic segmentation networks under the same conditions.

Keywords: adjacent position attention; attention mechanism; cross-dimensional interactive; semantic segmentation.

MeSH terms

  • Image Processing, Computer-Assisted / methods
  • Neural Networks, Computer*
  • Semantics
  • Volatile Organic Compounds*

Substances

  • Volatile Organic Compounds