We use information-theoretic tools to derive a novel analysis of Multi-source Domain Adaptation (MDA) from the representation learning perspective. Concretely, we study joint distribution alignment for supervised MDA with few target labels and unsupervised MDA with pseudo labels, where the latter is relatively hard and less commonly studied. We further provide algorithm-dependent generalization bounds for these two settings, where the generalization is characterized by the mutual information between the parameters and the data. Then we propose a novel deep MDA algorithm, implicitly addressing the target shift through joint alignment. Finally, the mutual information bounds are extended to this algorithm providing a non-vacuous gradient-norm estimation. The proposed algorithm has comparable performance to the state-of-the-art on target-shifted MDA benchmark with improved memory efficiency.
翻译:我们使用信息论工具从表示学习的角度推导多源领域自适应(MDA)的新分析。具体而言,我们研究有少量目标标签的有监督 MDA,以及具有伪标签的无监督 MDA,后者比较困难且不太常见。我们进一步提供了这两种情况下的算法相关的泛化下界,其中泛化性能是由参数和数据之间的互信息刻画的。然后,我们提出了一种新颖的深度 MDA 算法,通过联合对齐隐式解决目标偏移问题。最后,我们将互信息下界扩展到该算法上,提供了一个非平凡的梯度范数估计。该算法在具有改进的内存效率的目标位移 MDA 基准上具有可比较的良好性能,与现有最先进算法相当。