Efficient three-dimensional point cloud object detection based on improved Complex-YOLO

Yongxin Shao; Zhetao Sun; Aihong Tan; Tianhong Yan

doi:10.3389/fnbot.2023.1092564

Efficient three-dimensional point cloud object detection based on improved Complex-YOLO

Front Neurorobot. 2023 Feb 16:17:1092564. doi: 10.3389/fnbot.2023.1092564. eCollection 2023.

Authors

Yongxin Shao¹, Zhetao Sun¹, Aihong Tan¹, Tianhong Yan¹

Affiliation

¹ College of Mechanical and Electrical Engineering, China Jiliang University, Hangzhou, China.

Abstract

Lidar-based 3D object detection and classification is a critical task for autonomous driving. However, inferencing from exceedingly sparse 3D data in real-time is a formidable challenge. Complex-YOLO solves the problem of point cloud disorder and sparsity by projecting it onto the bird's-eye view and realizes real-time 3D object detection based on LiDAR. However, Complex-YOLO has no object height detection, a shallow network depth, and poor small-size object detection accuracy. To address these issues, this paper has made the following improvements: (1) adds a multi-scale feature fusion network to improve the algorithm's capability to detect small-size objects; (2) uses a more advanced RepVGG as the backbone network to improve network depth and overall detection performance; and (3) adds an effective height detector to the network to improve the height detection. Through experiments, we found that our algorithm's accuracy achieved good performance on the KITTI dataset, while the detection speed and memory usage were very superior, 48FPS on RTX3070Ti and 20FPS on GTX1060, with a memory usage of 841Mib.

Keywords: 3D object detection; deep learning; neural network; object detection; point cloud.

Grants and funding

This research was funded by the Provincial Natural Science Foundation of Zhejiang, grant number Y21F010057.