Learnable Descriptors for Visual Search

Andrea Migliorati; Attilio Fiandrotti; Gianluca Francini; Riccardo Leonardi

doi:10.1109/TIP.2020.3031216

Learnable Descriptors for Visual Search

IEEE Trans Image Process. 2021:30:80-91. doi: 10.1109/TIP.2020.3031216. Epub 2020 Nov 18.

Authors

Andrea Migliorati, Attilio Fiandrotti, Gianluca Francini, Riccardo Leonardi

PMID: 33095712
DOI: 10.1109/TIP.2020.3031216

Abstract

This work proposes LDVS, a learnable binary local descriptor devised for matching natural images within the MPEG CDVS framework. LDVS descriptors are learned so that they can be sign-quantized and compared using the Hamming distance. The underlying convolutional architecture enjoys a moderate parameters count for operations on mobile devices. Our experiments show that LDVS descriptors perform favorably over comparable learned binary descriptors at patch matching on two different datasets. A complete pair-wise image matching pipeline is then designed around LDVS descriptors, integrating them in the reference CDVS evaluation framework. Experiments show that LDVS descriptors outperform the compressed CDVS SIFT-like descriptors at pair-wise image matching over the challenging CDVS image dataset.