Deformable Dynamic Sampling and Dynamic Predictable Mask Mining for Image Inpainting

Cai Shang; Yu Zeng; Shu Yang; Xu Jia; Huchuan Lu; You He

doi:10.1109/TNNLS.2023.3316123

Deformable Dynamic Sampling and Dynamic Predictable Mask Mining for Image Inpainting

IEEE Trans Neural Netw Learn Syst. 2023 Oct 26:PP. doi: 10.1109/TNNLS.2023.3316123. Online ahead of print.

Authors

Cai Shang, Yu Zeng, Shu Yang, Xu Jia, Huchuan Lu, You He

PMID: 37883252
DOI: 10.1109/TNNLS.2023.3316123

Abstract

Existing image inpainting methods often produce artifacts that are caused by using vanilla convolution layers as building blocks that treat all image regions equally and generate holes at random locations with equal probability. This design does not differentiate the missing regions and valid regions in inference and does not consider the predictability of missing regions in training. To address these issues, we propose a deformable dynamic sampling (DDS) mechanism which is built on deformable convolutions (DCs), and a constraint is proposed to avoid the deformably sampled elements falling into the corrupted regions. Furthermore, to select both valid sample locations and suitable kernels dynamically, we equip DCs with content-aware dynamic kernel selection (DKS). In addition, to further encourage the DDS mechanism to find meaningful sampling locations, we propose to train the inpainting model with mined predictable regions as holes. During training, we jointly train a mask generator with the inpainting network to generate hole masks dynamically for each training sample. Thus, the mask generator can find large yet predictable missing regions as a better alternative to random masks. Extensive experiments demonstrate the advantages of our method over state-of-the-art methods qualitatively and quantitatively.