Recent advances in unsupervised domain adaptation have shown that mitigating the domain divergence by extracting the domain-invariant representation could significantly improve the generalization of a model to an unlabeled data domain. Nevertheless, the existing methods fail to effectively preserve the representation that is private to the label-missing domain, which could adversely affect the generalization. In this paper, we propose an approach to preserve such representation so that the latent distribution of the unlabeled domain could represent both the domain-invariant features and the individual characteristics that are private to the unlabeled domain. In particular, we demonstrate that maximizing the mutual information between the unlabeled domain and its latent space while mitigating the domain divergence can achieve such preservation. We also theoretically and empirically validate that preserving the representation that is private to the unlabeled domain is important and of necessity for the cross-domain generalization. Our approach outperforms state-of-the-art methods on several public datasets.
翻译:未受监督的领域适应方面的最新进展表明,通过提取域变量代表法来缩小域差异,可以大大改善将模型推广到无标签数据域的范围,然而,现有方法未能有效保存标签漏记域的私有代表法,这可能对通用性产生不利影响。在本文件中,我们提出一种方法来保存这种代表法,使未标签域的潜在分布既代表域变量特征,也代表未标签域的私有特性。特别是,我们证明尽可能扩大未标签域与其潜在空间之间的相互信息,同时减少域差异,可以实现这种保存。我们还从理论上和实验上证实,维护未标签漏记域的私有代表法对于跨域的通用性很重要,也是必要的。我们的方法超越了几个公共数据集的状态方法。