FSNet: Focus Scanning Network for Camouflaged Object Detection

Ze Song; Xudong Kang; Xiaohui Wei; Haibo Liu; Renwei Dian; Shutao Li

doi:10.1109/TIP.2023.3266659

FSNet: Focus Scanning Network for Camouflaged Object Detection

IEEE Trans Image Process. 2023:32:2267-2278. doi: 10.1109/TIP.2023.3266659. Epub 2023 Apr 21.

Authors

Ze Song, Xudong Kang, Xiaohui Wei, Haibo Liu, Renwei Dian, Shutao Li

PMID: 37067971
DOI: 10.1109/TIP.2023.3266659

Abstract

Camouflaged object detection (COD) aims to discover objects that blend in with the background due to similar colors or textures, etc. Existing deep learning methods do not systematically illustrate the key tasks in COD, which seriously hinders the improvement of its performance. In this paper, we introduce the concept of focus areas that represent some regions containing discernable colors or textures, and develop a two-stage focus scanning network for camouflaged object detection. Specifically, a novel encoder-decoder module is first designed to determine a region where the focus areas may appear. In this process, a multi-layer Swin transformer is deployed to encode global context information between the object and the background, and a novel cross-connection decoder is proposed to fuse cross-layer textures or semantics. Then, we utilize the multi-scale dilated convolution to obtain discriminative features with different scales in focus areas. Meanwhile, the dynamic difficulty aware loss is designed to guide the network paying more attention to structural details. Extensive experimental results on the benchmarks, including CAMO, CHAMELEON, COD10K, and NC4K, illustrate that the proposed method performs favorably against other state-of-the-art methods.