Leveraging Deep Learning for Visual Odometry Using Optical Flow

Tejas Pandey; Dexmont Pena; Jonathan Byrne; David Moloney

doi:10.3390/s21041313

Leveraging Deep Learning for Visual Odometry Using Optical Flow

Sensors (Basel). 2021 Feb 12;21(4):1313. doi: 10.3390/s21041313.

Authors

Tejas Pandey¹, Dexmont Pena¹, Jonathan Byrne¹, David Moloney¹

Affiliation

¹ Intel Research & Development, W23 CX68 Leixlip, Ireland.

Abstract

In this paper, we study deep learning approaches for monocular visual odometry (VO). Deep learning solutions have shown to be effective in VO applications, replacing the need for highly engineered steps, such as feature extraction and outlier rejection in a traditional pipeline. We propose a new architecture combining ego-motion estimation and sequence-based learning using deep neural networks. We estimate camera motion from optical flow using Convolutional Neural Networks (CNNs) and model the motion dynamics using Recurrent Neural Networks (RNNs). The network outputs the relative 6-DOF camera poses for a sequence, and implicitly learns the absolute scale without the need for camera intrinsics. The entire trajectory is then integrated without any post-calibration. We evaluate the proposed method on the KITTI dataset and compare it with traditional and other deep learning approaches in the literature.

Keywords: deep learning; ego-motion estimation; visual odometry.