学习程序薄弱监督的超高标签模式 (Learning Hyper Label Model for Programmatic Weak Supervision)

To reduce the human annotation efforts, the programmatic weak supervision (PWS) paradigm abstracts weak supervision sources as labeling functions (LFs) and involves a label model to aggregate the output of multiple LFs to produce training labels. Most existing label models require a parameter learning step for each dataset. In this work, we present a hyper label model that (once learned) infers the ground-truth labels for each dataset in a single forward pass without dataset-specific parameter learning. The hyper label model approximates an optimal analytical (yet computationally intractable) solution of the ground-truth labels. We train the model on synthetic data generated in the way that ensures the model approximates the analytical optimal solution, and build the model upon Graph Neural Network (GNN) to ensure the model prediction being invariant (or equivariant) to the permutation of LFs (or data points). On 14 real-world datasets, our hyper label model outperforms the best existing methods in both accuracy (by 1.4 points on average) and efficiency (by six times on average). Our code is available at https://github.com/wurenzhi/hyper_label_model

翻译：为了减少人类的批注工作,方案薄弱监督(PWS)模式将薄弱的监督源作为标签功能(LFs),并包含一个标签模型,以汇总多个低频的输出,以生成培训标签。大多数现有标签模型要求每个数据集有一个参数学习步骤。在这项工作中,我们提出了一个超级标签模型,(一旦了解)推算出单次远端通道中每个数据集的地面-真实标签,而没有具体数据集的参数学习。超高标签模型近似于地面真相标签的最佳分析(但难以计算)解决方案。我们培训合成数据模型的模式,以确保模型接近分析最佳解决方案,并在图形神经网络(GNNN)上建立模型,以确保模型预测与LFs(或数据点)的变异性(或变异性)推算。在14个真实世界数据集中,我们的超高标签模型在准确性(平均1.4个百分点)和效率(平均6倍)方面优于现有的最佳方法。我们的代码可以在 http://gres/gregream_malbrestal_gyal_gast_al_eval_eval.

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日