Unsupervised domain adaptation for classification tasks has achieved great progress in leveraging the knowledge in a labeled (source) domain to improve the task performance in an unlabeled (target) domain by mitigating the effect of distribution discrepancy. However, most existing methods can only handle unsupervised closed set domain adaptation (UCSDA), where the source and target domains share the same label set. In this paper, we target a more challenging but realistic setting: unsupervised open set domain adaptation (UOSDA), where the target domain has unknown classes that the source domain does not have. This study is the first to give the generalization bound of open set domain adaptation through theoretically investigating the risk of the target classifier on the unknown classes. The proposed generalization bound for open set domain adaptation has a special term, namely open set difference, which reflects the risk of the target classifier on unknown classes. According to this generalization bound, we propose a novel and theoretically guided unsupervised open set domain adaptation method: Distribution Alignment with Open Difference (DAOD), which is based on the structural risk minimization principle and open set difference regularization. The experiments on several benchmark datasets show the superior performance of the proposed UOSDA method compared with the state-of-the-art methods in the literature.
翻译:对分类任务进行不受监督的域适应工作,在利用标签(源)域的知识,通过减轻分布差异的影响,提高未标签(目标)域的任务性业绩方面取得了很大进展。然而,大多数现有方法只能处理未经监督的封闭的域适应(UCSDA),因为源和目标域共用相同的标签组。在本文件中,我们的目标是更具挑战性但更现实的设置:不受监督的开放的域适应(UOSDA),目标域有源域没有的未知类别。本项研究首先通过理论上调查不明类别目标分类师的风险,使开放的域适应(目标)域适应工作普遍化。为开放的域调整提议的通用定义有一个特殊术语,即开放的域调整(UCSDA),反映了目标分类在未知类别上的风险。根据这一总体约束,我们提出了一个新颖的和理论上指导的、未经统一但不受监督的开放的域适应方法:与公开差异(DAODODD)相匹配,这是基于结构风险最小化原则和开放设定差异规范。在几个基准数据组中进行的实验显示O-SDADA中的拟议方法的高级性。