以差异为基础,积极学习适应域域 (Discrepancy-Based Active Learning for Domain Adaptation)

The goal of the paper is to design active learning strategies which lead to domain adaptation under an assumption of domain shift in the case of Lipschitz labeling function. Building on previous work by Mansour et al. (2009) we adapt the concept of discrepancy distance between source and target distributions to restrict the maximization over the hypothesis class to a localized class of functions which are performing accurate labeling on the source domain. We derive generalization error bounds for such active learning strategies in terms of Rademacher average and localized discrepancy for general loss functions which satisfy a regularity condition. Practical algorithms are inferred from the theoretical bounds, one is based on greedy optimization and the other is a K-medoids algorithm. We also provide improved versions of the algorithms to address the case of large data sets. These algorithms are competitive against other state-of-the-art active learning techniques in the context of domain adaptation as shown in our numerical experiments, in particular on large data sets of around one hundred thousand images.

翻译：本文的目标是设计积极的学习策略,在Lipschitz标签功能的假设领域变化的情况下,在假设领域变化的情况下,导致域的适应。在Mansour等人(2009年)以前的工作基础上,我们调整了源和目标分布之间的差异距离概念,将假设类别和目标分布的最大化限制在对源域进行准确标签的本地化功能类别。我们从Rademacher平均值和符合常规条件的一般损失函数的局部差异中得出这种积极学习策略的概括性误差界限。从理论界限中推断出实用算法,其中一项基于贪婪优化,另一项基于K-Medoids算法。我们还提供了改进的算法版本,以处理大型数据集的案例。这些算法与我们数字实验中显示的域适应方面其他最先进的积极学习技术相比,这些算法具有竞争力,特别是大约10万个图像的大型数据集。

相关内容

主动学习

关注 240

主动学习是机器学习（更普遍的说是人工智能）的一个子领域，在统计学领域也叫查询学习、最优实验设计。“学习模块”和“选择策略”是主动学习算法的2个基本且重要的模块。主动学习是“一种学习方法，在这种方法中，学生会主动或体验性地参与学习过程，并且根据学生的参与程度，有不同程度的主动学习。” （Bonwell＆Eison 1991）Bonwell＆Eison（1991）指出：“学生除了被动地听课以外，还从事其他活动。” 在高等教育研究协会（ASHE）的一份报告中，作者讨论了各种促进主动学习的方法。他们引用了一些文献，这些文献表明学生不仅要做听，还必须做更多的事情才能学习。他们必须阅读，写作，讨论并参与解决问题。此过程涉及三个学习领域，即知识，技能和态度（KSA）。这种学习行为分类法可以被认为是“学习过程的目标”。特别是，学生必须从事诸如分析，综合和评估之类的高级思维任务。

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

专知会员服务

39+阅读 · 2020年11月3日

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

近期必读的6篇CVPR 2020【域自适应（Domain Adaptation）】相关论文和代码

专知会员服务

96+阅读 · 2020年3月24日

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》