通过封面透镜进行积极的学习 (Active Learning Through a Covering Lens)

Deep active learning aims to reduce the annotation cost for the training of deep models, which is notoriously data-hungry. Until recently, deep active learning methods were ineffectual in the low-budget regime, where only a small number of examples are annotated. The situation has been alleviated by recent advances in representation and self-supervised learning, which impart the geometry of the data representation with rich information about the points. Taking advantage of this progress, we study the problem of subset selection for annotation through a "covering" lens, proposing ProbCover - a new active learning algorithm for the low budget regime, which seeks to maximize Probability Coverage. We then describe a dual way to view the proposed formulation, from which one can derive strategies suitable for the high budget regime of active learning, related to existing methods like Coreset. We conclude with extensive experiments, evaluating ProbCover in the low-budget regime. We show that our principled active learning strategy improves the state-of-the-art in the low-budget regime in several image recognition benchmarks. This method is especially beneficial in the semi-supervised setting, allowing state-of-the-art semi-supervised methods to match the performance of fully supervised methods, while using much fewer labels nonetheless. Code is available at https://github.com/avihu111/TypiClust.

翻译：深层积极学习旨在降低深层模型培训的批注成本,这是众所周知的数据饥饿现象。直到最近,深层积极学习方法在低预算制度中是无效的,在低预算制度中只有为数不多的例子可以附加说明。最近的代表性和自我监督学习方面的进步缓解了这一状况,这些进展使数据代表的几何结构与关于这些点的丰富信息有了丰富的信息。利用这一进展,我们研究了通过“覆盖”透镜为批注选择子子集的问题,提出了ProbCover——低预算制度的新的积极学习算法,寻求最大限度地扩大概率覆盖。我们然后描述了一种双管齐下的方法来查看拟议的提法,从中可以产生适合高预算积极学习制度的战略,与核心系统等现有方法相关。我们以广泛的实验结束,对低预算制度中的ProbCover进行了评估。我们展示了我们的原则性积极学习战略在若干图像识别基准中改进了低预算制度中的状态。这种方法在半监督的设置中特别有益,同时允许使用低监管的半监督方法,同时允许使用低监管的MAC/Com-r-com 方法。

相关内容

主动学习

关注 240

主动学习是机器学习（更普遍的说是人工智能）的一个子领域，在统计学领域也叫查询学习、最优实验设计。“学习模块”和“选择策略”是主动学习算法的2个基本且重要的模块。主动学习是“一种学习方法，在这种方法中，学生会主动或体验性地参与学习过程，并且根据学生的参与程度，有不同程度的主动学习。” （Bonwell＆Eison 1991）Bonwell＆Eison（1991）指出：“学生除了被动地听课以外，还从事其他活动。” 在高等教育研究协会（ASHE）的一份报告中，作者讨论了各种促进主动学习的方法。他们引用了一些文献，这些文献表明学生不仅要做听，还必须做更多的事情才能学习。他们必须阅读，写作，讨论并参与解决问题。此过程涉及三个学习领域，即知识，技能和态度（KSA）。这种学习行为分类法可以被认为是“学习过程的目标”。特别是，学生必须从事诸如分析，综合和评估之类的高级思维任务。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日