Single-Image Depth Inference Using Generative Adversarial Networks

Daniel Stanley Tan; Chih-Yuan Yao; Conrado Ruiz Jr; Kai-Lung Hua

doi:10.3390/s19071708

Single-Image Depth Inference Using Generative Adversarial Networks

Sensors (Basel). 2019 Apr 10;19(7):1708. doi: 10.3390/s19071708.

Authors

Daniel Stanley Tan¹, Chih-Yuan Yao², Conrado Ruiz Jr³, Kai-Lung Hua^{4

5}

Affiliations

¹ Department of Computer Science and Information Engineering, National Taiwan University of Science and Technology, Taipei 10607, Taiwan. D10515805@mail.ntust.edu.tw.
² Department of Computer Science and Information Engineering, National Taiwan University of Science and Technology, Taipei 10607, Taiwan. cyuan.yao@csie.ntust.edu.tw.
³ Software Technology Department, De La Salle University, Manila 1004, Philippines. conrado.ruiz@dlsu.edu.ph.
⁴ Department of Computer Science and Information Engineering, National Taiwan University of Science and Technology, Taipei 10607, Taiwan. hua@mail.ntust.edu.tw.
⁵ Center for Cyber-Physical System Innovation, National Taiwan University of Science and Technology, Taipei 10607, Taiwan. hua@mail.ntust.edu.tw.

Abstract

Depth has been a valuable piece of information for perception tasks such as robot grasping, obstacle avoidance, and navigation, which are essential tasks for developing smart homes and smart cities. However, not all applications have the luxury of using depth sensors or multiple cameras to obtain depth information. In this paper, we tackle the problem of estimating the per-pixel depths from a single image. Inspired by the recent works on generative neural network models, we formulate the task of depth estimation as a generative task where we synthesize an image of the depth map from a single Red, Green, and Blue (RGB) input image. We propose a novel generative adversarial network that has an encoder-decoder type generator with residual transposed convolution blocks trained with an adversarial loss. Quantitative and qualitative experimental results demonstrate the effectiveness of our approach over several depth estimation works.

Keywords: depth estimation; encoder-decoder networks; generative adversarial networks.

Abstract

Grants and funding