Multi-EPL: Accurate multi-source domain adaptation

PLoS One. 2021 Aug 5;16(8):e0255754. doi: 10.1371/journal.pone.0255754. eCollection 2021.

Abstract

Given multiple source datasets with labels, how can we train a target model with no labeled data? Multi-source domain adaptation (MSDA) aims to train a model using multiple source datasets different from a target dataset in the absence of target data labels. MSDA is a crucial problem applicable to many practical cases where labels for the target data are unavailable due to privacy issues. Existing MSDA frameworks are limited since they align data without considering labels of the features of each domain. They also do not fully utilize the target data without labels and rely on limited feature extraction with a single extractor. In this paper, we propose Multi-EPL, a novel method for MSDA. Multi-EPL exploits label-wise moment matching to align the conditional distributions of the features for the labels, uses pseudolabels for the unavailable target labels, and introduces an ensemble of multiple feature extractors for accurate domain adaptation. Extensive experiments show that Multi-EPL provides the state-of-the-art performance for MSDA tasks in both image domains and text domains, improving the accuracy by up to 13.20%.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Database Management Systems / standards*
  • Datasets as Topic / standards
  • Deep Learning*

Grants and funding

This work was supported by Institute of Information & communications Technology Planning & Evaluation(IITP) grant funded by the Korea government(MSIT) (2020-0-00894, Flexible and Efficient Model Compression Method for Various Applications and Environments). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. The Institute of Engineering Research and ICT at Seoul National University provided research facilities for this work.