From Pixels to Precision: A Survey of Monocular Visual Odometry in Digital Twin Applications

Sensors (Basel). 2024 Feb 17;24(4):1274. doi: 10.3390/s24041274.

Abstract

This survey provides a comprehensive overview of traditional techniques and deep learning-based methodologies for monocular visual odometry (VO), with a focus on displacement measurement applications. This paper outlines the fundamental concepts and general procedures for VO implementation, including feature detection, tracking, motion estimation, triangulation, and trajectory estimation. This paper also explores the research challenges inherent in VO implementation, including scale estimation and ground plane considerations. The scientific literature is rife with diverse methodologies aiming to overcome these challenges, particularly focusing on the problem of accurate scale estimation. This issue has been typically addressed through the reliance on knowledge regarding the height of the camera from the ground plane and the evaluation of feature movements on that plane. Alternatively, some approaches have utilized additional tools, such as LiDAR or depth sensors. This survey of approaches concludes with a discussion of future research challenges and opportunities in the field of monocular visual odometry.

Keywords: deep learning; feature based; localization; machine learning; measurement; monocular; odometry; survey.

Grants and funding

This research was partially funded by the NATO Science for Peace and Security Programme, under the Multi-Year Project G5924, titled “Inspection and security by Robots interacting with Infrastructure digital twinS (IRIS)”.