使用少量标签进行学习自学培训 (Neighborhood-Regularized Self-Training for Learning with Few Labels) - 专知论文

会员服务 ·

0

标注 · Learning · 样本 · 相似度 · 噪声 ·

2023 年 2 月 15 日

Neighborhood-Regularized Self-Training for Learning with Few Labels

翻译：使用少量标签进行学习自学培训

Ran Xu,Yue Yu,Hejie Cui,Xuan Kan,Yanqiao Zhu,Joyce Ho,Chao Zhang,Carl Yang

from arxiv, Accepted to AAAI 2023

Training deep neural networks (DNNs) with limited supervision has been a popular research topic as it can significantly alleviate the annotation burden. Self-training has been successfully applied in semi-supervised learning tasks, but one drawback of self-training is that it is vulnerable to the label noise from incorrect pseudo labels. Inspired by the fact that samples with similar labels tend to share similar representations, we develop a neighborhood-based sample selection approach to tackle the issue of noisy pseudo labels. We further stabilize self-training via aggregating the predictions from different rounds during sample selection. Experiments on eight tasks show that our proposed method outperforms the strongest self-training baseline with 1.83% and 2.51% performance gain for text and graph datasets on average. Our further analysis demonstrates that our proposed data selection strategy reduces the noise of pseudo labels by 36.8% and saves 57.3% of the time when compared with the best baseline. Our code and appendices will be uploaded to https://github.com/ritaranx/NeST.

翻译：在有限的监督下培训深神经网络(DNNS)是一个受欢迎的研究课题,因为它可以大大减轻批注负担。自我培训成功地应用于半监督的学习任务,但自我培训的一个缺点是,它容易受到不正确的假标签标签标签标签的标签噪音的影响。由于贴有类似标签的样本往往具有相似的表示方式,我们开发了一种基于邻居的样本选择方法,以解决噪音假标签问题。我们通过汇总抽样选择期间不同回合的预测,进一步稳定了自我培训。对八项任务的实验表明,我们拟议的方法在平均文本和图表数据集的绩效收益方面超过了1.83%和2.51%的最强的自我培训基线。我们的进一步分析表明,我们拟议的数据选择战略将假标签的噪音减少36.8%,比最佳基线节省了57.3%的时间。我们的代码和附录将上传到 https://github.com/ritaranx/NERST。

0

相关内容

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

专知会员服务

60+阅读 · 2019年12月24日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知

133+阅读 · 2020年3月18日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

油菜叶色基因BnaC.HO1的功能分析及其突变导致叶色变异的机理

国家自然科学基金

0+阅读 · 2015年12月31日

化疗诱导的细胞衰老在神经母细胞瘤复发中的作用及分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

茄青枯拉尔氏菌Rs-T02编码精氨酸脱亚胺酶基因的功能分析

国家自然科学基金

0+阅读 · 2014年12月31日

稀土RE-Mn基合金相图及相关化合物磁性研究

国家自然科学基金

0+阅读 · 2014年12月31日

脉络膜新生血管疾病中HTRA1基因的表观遗传学机制

国家自然科学基金

0+阅读 · 2012年12月31日

抑癌基因PDCD4调控miR-184和miR-374a抑制鼻咽癌生长及促进凋亡

国家自然科学基金

0+阅读 · 2012年12月31日

不饱和有机硼化合物

国家自然科学基金

0+阅读 · 2011年12月31日

水稻亚端粒区功能研究

国家自然科学基金

0+阅读 · 2011年12月31日

胎源性miR-26b调控靶基因TAF12介导Wnt/β-catenin信号通路诱发大肠癌细胞凋亡的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

水稻VIP1同源基因的功能鉴定与应用研究

国家自然科学基金

0+阅读 · 2009年12月31日

Structure-aware Protein Self-supervised Learning

Arxiv

0+阅读 · 2023年4月8日

EMP-SSL: Towards Self-Supervised Learning in One Training Epoch

Arxiv

0+阅读 · 2023年4月8日

Supervised Contrastive Learning with Heterogeneous Similarity for Distribution Shifts

Arxiv

0+阅读 · 2023年4月7日

Localized Region Contrast for Enhancing Self-Supervised Learning in Medical Image Segmentation

Arxiv

0+阅读 · 2023年4月6日

Self-Supervised Video Similarity Learning

Arxiv

0+阅读 · 2023年4月6日

Do we need entire training data for adversarial training?

Arxiv

0+阅读 · 2023年4月5日

Multi-Level Contrastive Learning for Dense Prediction Task

Arxiv

0+阅读 · 2023年4月4日

Adaptive Transfer Learning on Graph Neural Networks

Arxiv

14+阅读 · 2021年7月20日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

Self-Supervised Learning For Few-Shot Image Classification

Self-Supervised Learning For Few-Shot Image Classification

Arxiv

19+阅读 · 2019年11月14日

VIP会员

文章信息

相关主题

相关VIP内容

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

专知会员服务

60+阅读 · 2019年12月24日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知

133+阅读 · 2020年3月18日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

相关论文

Structure-aware Protein Self-supervised Learning

Arxiv

0+阅读 · 2023年4月8日

EMP-SSL: Towards Self-Supervised Learning in One Training Epoch

Arxiv

0+阅读 · 2023年4月8日

Supervised Contrastive Learning with Heterogeneous Similarity for Distribution Shifts

Arxiv

0+阅读 · 2023年4月7日

Localized Region Contrast for Enhancing Self-Supervised Learning in Medical Image Segmentation

Arxiv

0+阅读 · 2023年4月6日

Self-Supervised Video Similarity Learning

Arxiv

0+阅读 · 2023年4月6日

Do we need entire training data for adversarial training?

Arxiv

0+阅读 · 2023年4月5日

Multi-Level Contrastive Learning for Dense Prediction Task

Arxiv

0+阅读 · 2023年4月4日

Adaptive Transfer Learning on Graph Neural Networks

Arxiv

14+阅读 · 2021年7月20日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

Self-Supervised Learning For Few-Shot Image Classification

Self-Supervised Learning For Few-Shot Image Classification

Arxiv

19+阅读 · 2019年11月14日

相关基金

油菜叶色基因BnaC.HO1的功能分析及其突变导致叶色变异的机理

国家自然科学基金

0+阅读 · 2015年12月31日

化疗诱导的细胞衰老在神经母细胞瘤复发中的作用及分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

茄青枯拉尔氏菌Rs-T02编码精氨酸脱亚胺酶基因的功能分析

国家自然科学基金

0+阅读 · 2014年12月31日

稀土RE-Mn基合金相图及相关化合物磁性研究

国家自然科学基金

0+阅读 · 2014年12月31日

脉络膜新生血管疾病中HTRA1基因的表观遗传学机制

国家自然科学基金

0+阅读 · 2012年12月31日

抑癌基因PDCD4调控miR-184和miR-374a抑制鼻咽癌生长及促进凋亡

国家自然科学基金

0+阅读 · 2012年12月31日

不饱和有机硼化合物

国家自然科学基金

0+阅读 · 2011年12月31日

水稻亚端粒区功能研究

国家自然科学基金

0+阅读 · 2011年12月31日

胎源性miR-26b调控靶基因TAF12介导Wnt/β-catenin信号通路诱发大肠癌细胞凋亡的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

水稻VIP1同源基因的功能鉴定与应用研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员