Large-Scale Place Recognition Based on Camera-LiDAR Fused Descriptor

Shaorong Xie; Chao Pan; Yaxin Peng; Ke Liu; Shihui Ying

doi:10.3390/s20102870

Large-Scale Place Recognition Based on Camera-LiDAR Fused Descriptor

Sensors (Basel). 2020 May 19;20(10):2870. doi: 10.3390/s20102870.

Authors

Shaorong Xie¹, Chao Pan², Yaxin Peng³, Ke Liu³, Shihui Ying³

Affiliations

¹ School of Computer Engineering and Science, Shanghai University, Shanghai 200444, China.
² School of Mechatronic Engineering and Automation, Shanghai University, Shanghai 200444, China.
³ Department of Mathematics, School of Science, Shanghai University, Shanghai 200444, China.

Abstract

In the field of autonomous driving, carriers are equipped with a variety of sensors, including cameras and LiDARs. However, the camera suffers from problems of illumination and occlusion, and the LiDAR encounters motion distortion, degenerate environment and limited ranging distance. Therefore, fusing the information from these two sensors deserves to be explored. In this paper, we propose a fusion network which robustly captures both the image and point cloud descriptors to solve the place recognition problem. Our contribution can be summarized as: (1) applying the trimmed strategy in the point cloud global feature aggregation to improve the recognition performance, (2) building a compact fusion framework which captures both the robust representation of the image and 3D point cloud, and (3) learning a proper metric to describe the similarity of our fused global feature. The experiments on KITTI and KAIST datasets show that the proposed fused descriptor is more robust and discriminative than the single sensor descriptor.

Keywords: deep learning; place recognition; retrieval; sensor fusion.

Abstract

Grants and funding