Multiple-Attention Mechanism Network for Semantic Segmentation

Dongli Wang; Shengliang Xiang; Yan Zhou; Jinzhen Mu; Haibin Zhou; Richard Irampaye

doi:10.3390/s22124477

Multiple-Attention Mechanism Network for Semantic Segmentation

Sensors (Basel). 2022 Jun 13;22(12):4477. doi: 10.3390/s22124477.

Authors

Dongli Wang¹, Shengliang Xiang¹, Yan Zhou¹, Jinzhen Mu², Haibin Zhou³, Richard Irampaye³

Affiliations

¹ School of Automation and Electronic Information, Xiangtan University, Xiangtan 411105, China.
² Shanghai Aerospace Control Technology Institute, Shanghai 201109, China.
³ School of Mathematics and Computational Science, Xiangtan University, Xiangtan 411105, China.

Abstract

Contextual information and the dependencies between dimensions is vital in image semantic segmentation. In this paper, we propose a multiple-attention mechanism network (MANet) for semantic segmentation in a very effective and efficient way. Concretely, the contributions are as follows: (1) a novel dual-attention mechanism for capturing feature dependencies in spatial and channel dimensions, where the adjacent position attention captures the dependencies between pixels well; (2) a new cross-dimensional interactive attention feature fusion module, which strengthens the fusion of fine location structure information in low-level features and category semantic information in high-level features. We conduct extensive experiments on semantic segmentation benchmarks including PASCAL VOC 2012 and Cityscapes datasets. Our MANet achieves the mIoU scores of 75.5% and 72.8% on PASCAL VOC 2012 and Cityscapes datasets, respectively. The effectiveness of the network is higher than the previous popular semantic segmentation networks under the same conditions.

Keywords: adjacent position attention; attention mechanism; cross-dimensional interactive; semantic segmentation.

MeSH terms

Image Processing, Computer-Assisted / methods
Neural Networks, Computer*
Semantics
Volatile Organic Compounds*

Substances

Volatile Organic Compounds

Grants and funding

61773330/National Natural Science Foundation of China