Progressive Modality Cooperation for Multi-Modality Domain Adaptation

Weichen Zhang; Dong Xu; Jing Zhang; Wanli Ouyang

doi:10.1109/TIP.2021.3052083

Progressive Modality Cooperation for Multi-Modality Domain Adaptation

IEEE Trans Image Process. 2021:30:3293-3306. doi: 10.1109/TIP.2021.3052083. Epub 2021 Mar 3.

Authors

Weichen Zhang, Dong Xu, Jing Zhang, Wanli Ouyang

PMID: 33481713
DOI: 10.1109/TIP.2021.3052083

Abstract

In this work, we propose a new generic multi-modality domain adaptation framework called Progressive Modality Cooperation (PMC) to transfer the knowledge learned from the source domain to the target domain by exploiting multiple modality clues (e.g., RGB and depth) under the multi-modality domain adaptation (MMDA) and the more general multi-modality domain adaptation using privileged information (MMDA-PI) settings. Under the MMDA setting, the samples in both domains have all the modalities. Through effective collaboration among multiple modalities, the two newly proposed modules in our PMC can select the reliable pseudo-labeled target samples, which captures the modality-specific information and modality-integrated information, respectively. Under the MMDA-PI setting, some modalities are missing in the target domain. Hence, to better exploit the multi-modality data in the source domain, we further propose the PMC with privileged information (PMC-PI) method by proposing a new multi-modality data generation (MMG) network. MMG generates the missing modalities in the target domain based on the source domain data by considering both domain distribution mismatch and semantics preservation, which are respectively achieved by using adversarial learning and conditioning on weighted pseudo semantic class labels. Extensive experiments on three image datasets and eight video datasets for various multi-modality cross-domain visual recognition tasks under both MMDA and MMDA-PI settings clearly demonstrate the effectiveness of our proposed PMC framework.