Monocular camera and laser based semantic mapping system with temporal-spatial data association for indoor mobile robots

Xu Song; Zuo Zhijiang; Xuan Liang; Zhou Huaidong

doi:10.1007/s11042-023-14796-1

Monocular camera and laser based semantic mapping system with temporal-spatial data association for indoor mobile robots

Multimed Tools Appl. 2023 Mar 7:1-26. doi: 10.1007/s11042-023-14796-1. Online ahead of print.

Authors

Xu Song¹, Zuo Zhijiang¹, Xuan Liang¹, Zhou Huaidong²

Affiliations

¹ School of Smart Manufacturing, Jianghan University, Wuhan, 430056 China.
² School of Mechanical Engineering & Automation, Beihang University, Beijing, 100191 China.

Abstract

In the future, the goal of service robots is to operate in human-centric indoor environments, requiring close cooperation with humans. In order to enable the robot to perform various interactive tasks, it is necessary for robots to perceive and understand environments from a human perspective. Semantic map is an augmented representation of the environment, containing both geometric information and high-level qualitative features. It can help the robot to comprehensively understand the environment and bridge the gap in human-robot interaction. In this paper, we propose a unified semantic mapping system for indoor mobile robots. This system utilizes the techniques of scene classification and object detection to construct semantic representations of indoor environments by fusing the data of a camera and a laser. In order to improve the accuracy of semantic mapping, the temporal-spatial correlation of semantics is leveraged to realize data association of semantic maps. Also, the proposed semantic mapping system is scalable and portable, which can be applied to different indoor scenarios. The proposed system was evaluated with collected datasets captured in indoor environments. Extensive experimental results indicate that the proposed semantic mapping system exhibits great performance in the robustness and accuracy of semantic mapping.

Keywords: Human-robot interaction; Semantic mapping; Temporal-spatial correlation.

© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.