WeaNF: 薄弱的监督与正常流动 (WeaNF: Weak Supervision with Normalizing Flows) - 专知论文

会员服务 ·

0

规范化的 · MoDELS · 标注 · 泛函 · 监督 ·

2022 年 4 月 28 日

WeaNF: Weak Supervision with Normalizing Flows

翻译：WeaNF: 薄弱的监督与正常流动

Andreas Stephan,Benjamin Roth

A popular approach to decrease the need for costly manual annotation of large data sets is weak supervision, which introduces problems of noisy labels, coverage and bias. Methods for overcoming these problems have either relied on discriminative models, trained with cost functions specific to weak supervision, and more recently, generative models, trying to model the output of the automatic annotation process. In this work, we explore a novel direction of generative modeling for weak supervision: Instead of modeling the output of the annotation process (the labeling function matches), we generatively model the input-side data distributions (the feature space) covered by labeling functions. Specifically, we estimate a density for each weak labeling source, or labeling function, by using normalizing flows. An integral part of our method is the flow-based modeling of multiple simultaneously matching labeling functions, and therefore phenomena such as labeling function overlap and correlations are captured. We analyze the effectiveness and modeling capabilities on various commonly used weak supervision data sets, and show that weakly supervised normalizing flows compare favorably to standard weak supervision baselines.

翻译：减少对大型数据集进行昂贵人工批注需要的流行做法是监督不力,这带来了吵闹标签、覆盖和偏差等问题。解决这些问题的方法要么依靠歧视模式,经过培训,具有监督不力所特有的成本功能,而最近又依靠基因模型,试图模拟自动批注过程的产出。在这项工作中,我们探索了为薄弱监督而采用基因化模型的新方向:我们没有模拟批注过程的产出(标签功能匹配),而是将标签功能所覆盖的输入-侧数据分布(特征空间)作为模型。具体地说,我们通过使用正常流来估计每个薄弱标签来源或标签功能的密度。我们方法的一个不可分割部分是多功能同时匹配标签功能的流基模型,因此,我们捕捉了标签功能重叠和关联等现象。我们分析了常见的各种薄弱监督数据集的有效性和建模能力,并表明,对薄弱的正常流量的监管比标准薄弱监督基线要好。

0

相关内容

规范化的

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

氢气对重度脓毒症肠屏障功能障碍的影响及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于比较转录组学的七鳃鳗类T、B淋巴细胞免疫应答信号传导分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

功能化磁性纳米颗粒/铁电聚合物复合微球的微流方法可控制备及改性研究

国家自然科学基金

0+阅读 · 2012年12月31日

Aβ在果蝇中表达引起的突触囊泡释放障碍及机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

多相图像分割的全局凸优化变分模型及其快速算法

国家自然科学基金

0+阅读 · 2011年12月31日

改进Max-SAT算法的关键技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

硫酸化修饰提高中药多糖的抗病毒和增强免疫活性及其作用机理研究

国家自然科学基金

0+阅读 · 2008年12月31日

磁性Pickering乳液界面流变学研究

国家自然科学基金

0+阅读 · 2008年12月31日

Flowformer: Linearizing Transformers with Conservation Flows

Arxiv

0+阅读 · 2022年6月16日

Closed-Form Diffeomorphic Transformations for Time Series Alignment

Arxiv

3+阅读 · 2022年6月16日

HyperImpute: Generalized Iterative Imputation with Automatic Model Selection

Arxiv

0+阅读 · 2022年6月15日

Vision Transformers with Hierarchical Attention

Arxiv

0+阅读 · 2022年6月15日

Rethinking Generalization in Few-Shot Classification

Arxiv

0+阅读 · 2022年6月15日

AuxMix: Semi-Supervised Learning with Unconstrained Unlabeled Data

AuxMix: Semi-Supervised Learning with Unconstrained Unlabeled Data

Arxiv

0+阅读 · 2022年6月14日

Energy Flows: Towards Determinant-Free Training of Normalizing Flows

Arxiv

0+阅读 · 2022年6月14日

Small Data Challenges in Big Data Era: A Survey of Recent Progress on Unsupervised and Semi-Supervised Methods

Small Data Challenges in Big Data Era: A Survey of Recent Progress on Unsupervised and Semi-Supervised Methods

Arxiv

88+阅读 · 2019年3月27日

Generating Diverse and Accurate Visual Captions by Comparative Adversarial Learning

Arxiv

10+阅读 · 2018年4月11日

Generative Adversarial Networks and Probabilistic Graph Models for Hyperspectral Image Classification

Arxiv

11+阅读 · 2018年2月10日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《美国海军陆战队软件定义网络应用案例：分布式防火墙自动化系统》148页

《多体环境下定位导航授时（PNT）系统研究》228页

软件定义无线电（SDR）：商业与军事领域的技术、应用及未来趋势

《攻势防空作战中无人追击者/规避者最优轨迹研究（含动态交战区建模）》95页

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Flowformer: Linearizing Transformers with Conservation Flows

Arxiv

0+阅读 · 2022年6月16日

Closed-Form Diffeomorphic Transformations for Time Series Alignment

Arxiv

3+阅读 · 2022年6月16日

HyperImpute: Generalized Iterative Imputation with Automatic Model Selection

Arxiv

0+阅读 · 2022年6月15日

Vision Transformers with Hierarchical Attention

Arxiv

0+阅读 · 2022年6月15日

Rethinking Generalization in Few-Shot Classification

Arxiv

0+阅读 · 2022年6月15日

AuxMix: Semi-Supervised Learning with Unconstrained Unlabeled Data

AuxMix: Semi-Supervised Learning with Unconstrained Unlabeled Data

Arxiv

0+阅读 · 2022年6月14日

Energy Flows: Towards Determinant-Free Training of Normalizing Flows

Arxiv

0+阅读 · 2022年6月14日

Small Data Challenges in Big Data Era: A Survey of Recent Progress on Unsupervised and Semi-Supervised Methods

Small Data Challenges in Big Data Era: A Survey of Recent Progress on Unsupervised and Semi-Supervised Methods

Arxiv

88+阅读 · 2019年3月27日

Generating Diverse and Accurate Visual Captions by Comparative Adversarial Learning

Arxiv

10+阅读 · 2018年4月11日

Generative Adversarial Networks and Probabilistic Graph Models for Hyperspectral Image Classification

Arxiv

11+阅读 · 2018年2月10日

相关基金

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

氢气对重度脓毒症肠屏障功能障碍的影响及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于比较转录组学的七鳃鳗类T、B淋巴细胞免疫应答信号传导分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

功能化磁性纳米颗粒/铁电聚合物复合微球的微流方法可控制备及改性研究

国家自然科学基金

0+阅读 · 2012年12月31日

Aβ在果蝇中表达引起的突触囊泡释放障碍及机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

多相图像分割的全局凸优化变分模型及其快速算法

国家自然科学基金

0+阅读 · 2011年12月31日

改进Max-SAT算法的关键技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

硫酸化修饰提高中药多糖的抗病毒和增强免疫活性及其作用机理研究

国家自然科学基金

0+阅读 · 2008年12月31日

磁性Pickering乳液界面流变学研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员