Few-Shot Object Detection With Self-Supervising and Cooperative Classifier

IEEE Trans Neural Netw Learn Syst. 2024 Apr;35(4):5435-5446. doi: 10.1109/TNNLS.2022.3204597. Epub 2024 Apr 4.

Abstract

Few-shot object detection (FSOD), which detects novel objects with only a few training instances, has recently attracted more attention. Previous works focus on making the most use of label information of objects. Still, they fail to consider the structural and semantic information of the image itself and solve the misclassification between data-abundant base classes and data-scarce novel classes efficiently. In this article, we propose FSOD with Self-Supervising and Cooperative Classifier ( [Formula: see text]) approach to deal with those concerns. Specifically, we analyze the underlying performance degradation of novel classes in FSOD and discover that false-positive samples are the main reason. By looking into these false-positive samples, we further notice that misclassifying novel classes as base classes are the main cause. Thus, we introduce double RoI heads into the existing Fast-RCNN to learn more specific features for novel classes. We also consider using self-supervised learning (SSL) to learn more structural and semantic information. Finally, we propose a cooperative classifier (CC) with the base-novel regularization to maximize the interclass variance between base and novel classes. In the experiment, [Formula: see text] outperforms all the latest baselines in most cases on PASCAL VOC and COCO.