训练前和自训练是补充自然语言理解的补充 (Task-adaptive Pre-training and Self-training are Complementary for Natural Language Understanding) - 专知论文

会员服务 ·

0

TFS · 可理解性 · 未标记 · 任务对话系统 · SimPLe ·

2023 年 2 月 19 日

Task-adaptive Pre-training and Self-training are Complementary for Natural Language Understanding

翻译：训练前和自训练是补充自然语言理解的补充

Shiyang Li,Semih Yavuz,Wenhu Chen,Xifeng Yan

from arxiv, Findings of EMNLP 2021

Task-adaptive pre-training (TAPT) and Self-training (ST) have emerged as the major semi-supervised approaches to improve natural language understanding (NLU) tasks with massive amount of unlabeled data. However, it's unclear whether they learn similar representations or they can be effectively combined. In this paper, we show that TAPT and ST can be complementary with simple TFS protocol by following TAPT -> Finetuning -> Self-training (TFS) process. Experimental results show that TFS protocol can effectively utilize unlabeled data to achieve strong combined gains consistently across six datasets covering sentiment classification, paraphrase identification, natural language inference, named entity recognition and dialogue slot classification. We investigate various semi-supervised settings and consistently show that gains from TAPT and ST can be strongly additive by following TFS procedure. We hope that TFS could serve as an important semi-supervised baseline for future NLP studies.

翻译：任务调整前培训(TAPT)和自我培训(ST)是提高自然语言理解(NLU)任务的主要半监督方法,具有大量未贴标签的数据。然而,尚不清楚他们是否学会了类似的表述方式,还是可以有效地将其结合起来。在本文中,我们表明TAPT和ST可以通过采用TAPT -- > 微调 -- > 自我培训(TFS)程序来补充简单的TFS协议。实验结果显示,TFS协议可以有效地利用未贴标签的数据,在六个数据集之间实现强有力的综合收益,这六个数据集包括情绪分类、参数识别、自然语言推论、实体识别和对话时间档分类。我们调查了不同的半监督环境,并一致表明TAPT和ST的成果可以通过TFS程序得到强大的补充。我们希望TFS可以作为未来NLP研究的重要的半监督基线。

0

相关内容

TFS

IEEE模糊系统会刊TFS(IEEE Transactions on Fuzzy Systems)是由IEEE所属神经网络联合会发起和创办的一种新出版物。刊登有关模糊系统的理论、设计和应用方面的高质量技术论文,特别重视工程系统和科学应用,同时刊登信息以及有关其所载文章的评论和反驳。官网地址：http://dblp.uni-trier.de/db/journals/tfs/

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

发射可调铂(II)配合物的设计和新型静电喷雾沉积电致发光器件的制备

国家自然科学基金

0+阅读 · 2015年12月31日

苹果MdLAR1和MdANR2基因等位变异的发掘及其与果实原花青素含量的关联分析

国家自然科学基金

0+阅读 · 2015年12月31日

半监督进化文本聚类算法在动态多源文本分析上的研究

国家自然科学基金

2+阅读 · 2014年12月31日

三种人畜共患传染性病原菌同步快速富集与检测技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于Exemplar-Classifier思想的高分辨率光学遥感影像目标识别研究

国家自然科学基金

2+阅读 · 2013年12月31日

基于Raptor码的无线体域网高效信道编码技术

国家自然科学基金

0+阅读 · 2013年12月31日

基于地表和高程信息的知识辅助机载雷达信号处理方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

大黄鱼抗氧化酶Peroxiredoxin IV调控炎症反应的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

SFRP2和Periostin在调控瘢痕疙瘩成纤维细胞生成1型胶原中的分子机制初探

国家自然科学基金

0+阅读 · 2011年12月31日

基于Pt-Pt间相互作用的核酸适体生物传感器的研究

国家自然科学基金

0+阅读 · 2009年12月31日

ALADIN-NST: Self-supervised disentangled representation learning of artistic style through Neural Style Transfer

Arxiv

0+阅读 · 2023年4月12日

Reason from Context with Self-supervised Learning

Arxiv

0+阅读 · 2023年4月11日

Scaling Pre-trained Language Models to Deeper via Parameter-efficient Architecture

Arxiv

0+阅读 · 2023年4月11日

For Pre-Trained Vision Models in Motor Control, Not All Policy Learning Methods are Created Equal

Arxiv

0+阅读 · 2023年4月10日

CAVL: Learning Contrastive and Adaptive Representations of Vision and Language

Arxiv

0+阅读 · 2023年4月10日

A Preliminary Evaluation of ChatGPT for Zero-shot Dialogue Understanding

Arxiv

0+阅读 · 2023年4月9日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization

Arxiv

17+阅读 · 2020年6月2日

Differentiable Reasoning on Large Knowledge Bases and Natural Language

Arxiv

12+阅读 · 2019年12月17日

VIP会员

文章信息

相关主题

任务对话系统

相关VIP内容

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《全谱战争——从拓宽工具到思考不可思考之事》

《FPV武装无人机的战斗飞行艺术与科学》最新报告

无人机作战：演进、创新与未来战场

《反无人机：用于无人机探测与定位的多输入多输出雷达》最新69页

相关资讯

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

相关论文

ALADIN-NST: Self-supervised disentangled representation learning of artistic style through Neural Style Transfer

Arxiv

0+阅读 · 2023年4月12日

Reason from Context with Self-supervised Learning

Arxiv

0+阅读 · 2023年4月11日

Scaling Pre-trained Language Models to Deeper via Parameter-efficient Architecture

Arxiv

0+阅读 · 2023年4月11日

For Pre-Trained Vision Models in Motor Control, Not All Policy Learning Methods are Created Equal

Arxiv

0+阅读 · 2023年4月10日

CAVL: Learning Contrastive and Adaptive Representations of Vision and Language

Arxiv

0+阅读 · 2023年4月10日

A Preliminary Evaluation of ChatGPT for Zero-shot Dialogue Understanding

Arxiv

0+阅读 · 2023年4月9日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization

Arxiv

17+阅读 · 2020年6月2日

Differentiable Reasoning on Large Knowledge Bases and Natural Language

Arxiv

12+阅读 · 2019年12月17日

相关基金

发射可调铂(II)配合物的设计和新型静电喷雾沉积电致发光器件的制备

国家自然科学基金

0+阅读 · 2015年12月31日

苹果MdLAR1和MdANR2基因等位变异的发掘及其与果实原花青素含量的关联分析

国家自然科学基金

0+阅读 · 2015年12月31日

半监督进化文本聚类算法在动态多源文本分析上的研究

国家自然科学基金

2+阅读 · 2014年12月31日

三种人畜共患传染性病原菌同步快速富集与检测技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于Exemplar-Classifier思想的高分辨率光学遥感影像目标识别研究

国家自然科学基金

2+阅读 · 2013年12月31日

基于Raptor码的无线体域网高效信道编码技术

国家自然科学基金

0+阅读 · 2013年12月31日

基于地表和高程信息的知识辅助机载雷达信号处理方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

大黄鱼抗氧化酶Peroxiredoxin IV调控炎症反应的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

SFRP2和Periostin在调控瘢痕疙瘩成纤维细胞生成1型胶原中的分子机制初探

国家自然科学基金

0+阅读 · 2011年12月31日

基于Pt-Pt间相互作用的核酸适体生物传感器的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员