Domain adaptation has received a lot of attention in recent years, and many algorithms have been proposed with impressive progress. However, it is still not fully explored concerning the joint probability distribution (P(X, Y)) distance for this problem, since its empirical estimation derived from the maximum mean discrepancy (joint maximum mean discrepancy, JMMD) will involve complex tensor-product operator that is hard to manipulate. To solve this issue, this paper theoretically derives a unified form of JMMD that is easy to optimize, and proves that the marginal, class conditional and weighted class conditional probability distribution distances are our special cases with different label kernels, among which the weighted class conditional one not only can realize feature alignment across domains in the category level, but also deal with imbalance dataset using the class prior probabilities. From the revealed unified JMMD, we illustrate that JMMD degrades the feature-label dependence (discriminability) that benefits to classification, and it is sensitive to the label distribution shift when the label kernel is the weighted class conditional one. Therefore, we leverage Hilbert Schmidt independence criterion and propose a novel MMD matrix to promote the dependence, and devise a novel label kernel that is robust to label distribution shift. Finally, we conduct extensive experiments on several cross-domain datasets to demonstrate the validity and effectiveness of the revealed theoretical results.
翻译:近些年来,对网域的适应引起了很多关注,许多算法都提出了令人印象深刻的进展。然而,对于这一问题的共同概率分布(P(X),Y)距离,仍未充分探讨这一问题的共同概率分布(P(X),Y)距离,因为根据最大平均值差异(联合最大平均值差异,JMD)得出的实证估计将涉及难以操纵的复杂的高压产品操作者。为了解决这个问题,本文件理论上产生了一种易于优化的统一的JMMD格式,并证明边际、等级有条件和加权等级有条件概率分布距离是我们使用不同标签内核的特殊案例,其中加权舱不仅能够实现类别一级跨域的特征协调,而且还能够利用先前的类别概率处理不平衡数据集。从已披露的统一JMD中,我们说明,JMMD将降低对分类的好处(差异性)的特性标签依赖度,当标签内核为加权等级时,对标签分配变化十分敏感。因此,我们利用了Hilbert Schmid 独立性标准,并提出了新的MMD矩阵标准,不仅可以实现跨类的特征调整,我们最终展示了对数据库的可靠性。