Attention-Based Context Aware Network for Semantic Comprehension of Aerial Scenery

Weipeng Shi; Wenhu Qin; Zhonghua Yun; Peng Ping; Kaiyang Wu; Yuke Qu

doi:10.3390/s21061983

Attention-Based Context Aware Network for Semantic Comprehension of Aerial Scenery

Sensors (Basel). 2021 Mar 11;21(6):1983. doi: 10.3390/s21061983.

Authors

Weipeng Shi¹, Wenhu Qin¹, Zhonghua Yun¹, Peng Ping¹, Kaiyang Wu¹, Yuke Qu¹

Affiliation

¹ School of Instrument Science and Engineering, Southeast University, Nanjing 210096, China.

Abstract

It is essential for researchers to have a proper interpretation of remote sensing images (RSIs) and precise semantic labeling of their component parts. Although FCN (Fully Convolutional Networks)-like deep convolutional network architectures have been widely applied in the perception of autonomous cars, there are still two challenges in the semantic segmentation of RSIs. The first is to identify details in high-resolution images with complex scenes and to solve the class-mismatch issues; the second is to capture the edge of objects finely without being confused by the surroundings. HRNET has the characteristics of maintaining high-resolution representation by fusing feature information with parallel multi-resolution convolution branches. We adopt HRNET as a backbone and propose to incorporate the Class-Oriented Region Attention Module (CRAM) and Class-Oriented Context Fusion Module (CCFM) to analyze the relationships between classes and patch regions and between classes and local or global pixels, respectively. Thus, the perception capability of the model for the detailed part in the aerial image can be enhanced. We leverage these modules to develop an end-to-end semantic segmentation model for aerial images and validate it on the ISPRS Potsdam and Vaihingen datasets. The experimental results show that our model improves the baseline accuracy and outperforms some commonly used CNN architectures.

Keywords: computer vision; convolutional neural network; deep learning; pattern recognition; remote sensing; self-attention; semantic segmentation.

Abstract

Grants and funding