Real-time airplane detection using multi-dimensional attention and feature fusion

PeerJ Comput Sci. 2023 Apr 3:9:e1331. doi: 10.7717/peerj-cs.1331. eCollection 2023.

Abstract

The remote sensing image airplane object detection tasks remain a challenge such as missed detection and misdetection, and that is due to the low resolution occupied by airplane objects and large background noise. To address the problems above, we propose an AE-YOLO (Accurate and Efficient Yolov4-tiny) algorithm and thus obtain higher detection precision for airplane detection in remote sensing images. A multi-dimensional channel and spatial attention module is designed to filter out background noise information, and we also adopt a local cross-channel interaction strategy without dimensionality reduction so as to reduce the loss of local information caused by the scaling of the fully connected layer. The weighted two-way feature pyramid operation is used to fuse features and the correlation between different channels is learned to improve the utilization of features. A lightweight convolution module is exploited to reconstruct the network, which effectively reduce the parameters and computations while improving the accuracy of the detection model. Extensive experiments validate that the proposed algorithm is more lightweight and efficient for airplane detection. Moreover, experimental results on the airplane dataset show that the proposed algorithm meets real-time requirements, and its detection accuracy is 7.76% higher than the original algorithm.

Keywords: Airplane detection; Attention module; Feature fusion; Lightweight; Remote sense image.

Grants and funding

The work was supported by the Science and Technology Research and Development Plan Project of Handan, Hebei Province, China (21422031289) and the Ministry of Education University-Industry Collaborative Education Program, China (220601828023121). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.