AutoWS-Bench-1011:以100个标签作为自动薄弱监督基准 (AutoWS-Bench-101: Benchmarking Automated Weak Supervision with 100 Labels)

Weak supervision (WS) is a powerful method to build labeled datasets for training supervised models in the face of little-to-no labeled data. It replaces hand-labeling data with aggregating multiple noisy-but-cheap label estimates expressed by labeling functions (LFs). While it has been used successfully in many domains, weak supervision's application scope is limited by the difficulty of constructing labeling functions for domains with complex or high-dimensional features. To address this, a handful of methods have proposed automating the LF design process using a small set of ground truth labels. In this work, we introduce AutoWS-Bench-101: a framework for evaluating automated WS (AutoWS) techniques in challenging WS settings -- a set of diverse application domains on which it has been previously difficult or impossible to apply traditional WS techniques. While AutoWS is a promising direction toward expanding the application-scope of WS, the emergence of powerful methods such as zero-shot foundation models reveals the need to understand how AutoWS techniques compare or cooperate with modern zero-shot or few-shot learners. This informs the central question of AutoWS-Bench-101: given an initial set of 100 labels for each task, we ask whether a practitioner should use an AutoWS method to generate additional labels or use some simpler baseline, such as zero-shot predictions from a foundation model or supervised learning. We observe that in many settings, it is necessary for AutoWS methods to incorporate signal from foundation models if they are to outperform simple few-shot baselines, and AutoWS-Bench-101 promotes future research in this direction. We conclude with a thorough ablation study of AutoWS methods.

翻译：微弱监管 (WS) 是一种强大的方法, 用来在面临低到无标签的标签数据的情况下对受监督模型进行培训。它用标签功能( LF) 表达的多重噪音和便宜标签估计来取代手贴标签数据。虽然它在许多领域被成功使用, 薄弱的监督应用范围却由于难以为具有复杂或高维特征的领域构建标签功能而受到限制。为了解决这个问题, 少数方法提议使用少量的地面真相标签来自动调整LF 101 设计进程。在这项工作中, 我们引入了AutoWS- Bench- 101: 一个用于在挑战性的WS设置中评估自动自定义(AutoWS)技术(AutoWS) (AutoWS) (AutoWS) 技术(AutoWS) 10: 一组不同的应用领域过去很难或不可能应用传统的WSWS技术。虽然AWA是扩大WS应用范围, 但是, 强大的方法的出现表明他们需要理解AutoWS技术如何与现代零光或少得手的学习者进行对比。。这让AAAWS- Besh- besh- scar- hust- hust- 10 基础的中央问题从观察基础, 成为一个更简单的基础, 我们使用一个基础, 一种基础, 一种基础, 一种更简单的任务是使用一个基础, 一种基础, 一种基础。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日