PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud

Fangyu Li; Weizheng Jin; Cien Fan; Lian Zou; Qingsheng Chen; Xiaopeng Li; Hao Jiang; Yifeng Liu

doi:10.3390/s21010136

PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud

Sensors (Basel). 2020 Dec 28;21(1):136. doi: 10.3390/s21010136.

Authors

Fangyu Li¹, Weizheng Jin¹, Cien Fan¹, Lian Zou¹, Qingsheng Chen¹, Xiaopeng Li¹, Hao Jiang¹, Yifeng Liu²

Affiliations

¹ School of Electronic Information, Wuhan University, Wuhan 430072, China.
² National Engineering Laboratory for Risk Perception and Prevention (NEL-RPP), Beijing 100041, China.

Abstract

3D object detection in LiDAR point clouds has been extensively used in autonomous driving, intelligent robotics, and augmented reality. Although the one-stage 3D detector has satisfactory training and inference speed, there are still some performance problems due to insufficient utilization of bird's eye view (BEV) information. In this paper, a new backbone network is proposed to complete the cross-layer fusion of multi-scale BEV feature maps, which makes full use of various information for detection. Specifically, our proposed backbone network can be divided into a coarse branch and a fine branch. In the coarse branch, we use the pyramidal feature hierarchy (PFH) to generate multi-scale BEV feature maps, which retain the advantages of different levels and serves as the input of the fine branch. In the fine branch, our proposed pyramid splitting and aggregation (PSA) module deeply integrates different levels of multi-scale feature maps, thereby improving the expressive ability of the final features. Extensive experiments on the challenging KITTI-3D benchmark show that our method has better performance in both 3D and BEV object detection compared with some previous state-of-the-art methods. Experimental results with average precision (AP) prove the effectiveness of our network.

Keywords: 3D object detection; LiDAR; autonomous driving; convolutional neural networks; voxel.

Abstract

Grants and funding