GAN-Based Image Colorization for Self-Supervised Visual Feature Learning

Sensors (Basel). 2022 Feb 18;22(4):1599. doi: 10.3390/s22041599.

Abstract

Large-scale labeled datasets are generally necessary for successfully training a deep neural network in the computer vision domain. In order to avoid the costly and tedious work of manually annotating image datasets, self-supervised learning methods have been proposed to learn general visual features automatically. In this paper, we first focus on image colorization with generative adversarial networks (GANs) because of their ability to generate the most realistic colorization results. Then, via transfer learning, we use this as a proxy task for visual understanding. Particularly, we propose to use conditional GANs (cGANs) for image colorization and transfer the gained knowledge to two other downstream tasks, namely, multilabel image classification and semantic segmentation. This is the first time that GANs have been used for self-supervised feature learning through image colorization. Through extensive experiments with the COCO and Pascal datasets, we show an increase of 5% for the classification task and 2.5% for the segmentation task. This demonstrates that image colorization with conditional GANs can boost other downstream tasks' performance without the need for manual annotation.

Keywords: convolutional neural network; generative adversarial network; image colorization; self-supervised learning; transfer learning.

MeSH terms

  • Computers
  • Image Processing, Computer-Assisted* / methods
  • Neural Networks, Computer*
  • Semantics
  • Vision, Ocular