VISOR-NET: Visibility Estimation Based on Deep Ordinal Relative Learning under Discrete-Level Labels

Lina Xun; Huichao Zhang; Qing Yan; Qi Wu; Jun Zhang

doi:10.3390/s22166227

VISOR-NET: Visibility Estimation Based on Deep Ordinal Relative Learning under Discrete-Level Labels

Sensors (Basel). 2022 Aug 19;22(16):6227. doi: 10.3390/s22166227.

Authors

Lina Xun¹, Huichao Zhang¹, Qing Yan¹, Qi Wu¹, Jun Zhang²

Affiliations

¹ The Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, School of Electrical Engineering and Automation, Anhui University, Hefei 230601, China.
² School of Artificial Intelligence, Anhui University, Hefei 230601, China.

Abstract

This paper proposes a novel end-to-end pipeline that uses the ordinal information and relative relation of images for visibility estimation (VISOR-NET). By encoding ordinal information into a set of relatively ordered image pairs, VISOR-NET can learn a global ranking function effectively. Due to the lack of real scenes or continuous labels in public foggy datasets, we collect a large-scale dataset that we term Foggy Highway Visibility Images (FHVI), which are taken from real surveillance scenes, and synthesize an INDoor Foggy images dataset (INDF) with continuous annotation. This work measures the estimation effectiveness on two public datasets and our FHVI dataset as a classification task and then on the INDF dataset as a regression task. Comprehensive experiments with existing deep-learning methods demonstrate the performance of the proposed method in terms of estimation accuracy, the convergence rate, model stability, and data requirements. Moreover, this method can extend inter-level visibility estimation to intra-level visibility estimation and can realize approximate regression estimation under discrete-level labels.

Keywords: deep learning; ordinal regression; relative learning; visibility estimation.

MeSH terms

Deep Learning*

Grants and funding

2018YFB0504604/The National Key R&D Plan of China