很少热学习模块提示多任务培训前 (Multi-Task Pre-Training of Modular Prompt for Few-Shot Learning) - 专知论文

会员服务 ·

0

Prompt · tuning · Learning · 小样本学习 · 全 ·

2022 年 10 月 24 日

Multi-Task Pre-Training of Modular Prompt for Few-Shot Learning

翻译：很少热学习模块提示多任务培训前

Tianxiang Sun,Zhengfu He,Qin Zhu,Xipeng Qiu,Xuanjing Huang

from arxiv, Code and data are publicly available at https://github.com/Hzfinfdu/MPMP

Prompt tuning is a parameter-efficient approach to adapting pre-trained language models to downstream tasks. Although prompt tuning has been shown to match the performance of full model tuning when training data is sufficient, it tends to struggle in few-shot learning settings. In this paper, we present Multi-task Pre-trained Modular Prompt (MP2) to boost prompt tuning for few-shot learning. MP2 is a set of combinable prompts pre-trained on 38 Chinese tasks. On downstream tasks, the pre-trained prompts are selectively activated and combined, leading to strong compositional generalization to unseen tasks. To bridge the gap between pre-training and fine-tuning, we formulate upstream and downstream tasks into a unified machine reading comprehension task. Extensive experiments under two learning paradigms, i.e., gradient descent and black-box tuning, show that MP2 significantly outperforms prompt tuning, full model tuning, and prior prompt pre-training methods in few-shot settings. In addition, we demonstrate that MP2 can achieve surprisingly fast and strong adaptation to downstream tasks by merely learning 8 parameters to combine the pre-trained modular prompts.

翻译：快速调试是使经过培训的语文模式适应下游任务的一种具有参数效率的方法。虽然在培训数据充足时,快速调试已经证明与完全模型调试的性能相匹配,但往往会在几发学习环境中挣扎。在本文件中,我们介绍了多任务预调模块(MP2),以加快对微粒学习的快速调试。MP2是一套对38个中国任务进行预先培训的可燃提示。在下游任务中,预先培训的提示被有选择地激活和组合,导致对看不见任务进行强有力的组合化概括化。为了缩小培训前和微调之间的差距,我们将上游和下游任务发展成一个统一的机器阅读理解任务。在两种学习模式(即梯度下下移和黑盒调)下进行的广泛实验表明,MP2在微粒情况下大大超过快速调、完全模型调试用以及先前的快速培训前方法。此外,我们证明,MP2可以通过仅仅学习8项参数,将经过培训前模块的及时性综合起来,从而对下游任务作出惊人的快速和有力的调整。

0

相关内容

Prompt

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

循环肿瘤细胞Stat3/Twist双信号通路交互作用对EMT编程的乳腺癌转移的调控与干预

国家自然科学基金

0+阅读 · 2014年12月31日

路面结构破坏行为无网格法分析与模拟

国家自然科学基金

0+阅读 · 2014年12月31日

多酸在二氧化钛纳米晶表面的自组装

国家自然科学基金

0+阅读 · 2014年12月31日

基于SAR的浅海水域地形反演研究

国家自然科学基金

0+阅读 · 2014年12月31日

稀疏植被覆盖条件下土壤盐渍化高光谱遥感定量反演与动态监测

国家自然科学基金

0+阅读 · 2014年12月31日

CLIC1在动脉粥样硬化过程内皮细胞损伤与炎症中的作用及丹参酮ⅡA的干预

国家自然科学基金

0+阅读 · 2013年12月31日

综合InSAR与GPS江苏沿海湿地储水量变化监测研究

国家自然科学基金

0+阅读 · 2012年12月31日

格式塔规律的几何推理关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

维生素E琥珀酸酯诱导胃癌细胞凋亡过程中内质网应激与氧化应激的交互作用

国家自然科学基金

0+阅读 · 2011年12月31日

星载高光谱热红外数据的温度与发射率分离算法研究

国家自然科学基金

0+阅读 · 2009年12月31日

Few-Shot Preference Learning for Human-in-the-Loop RL

Arxiv

0+阅读 · 2022年12月6日

UniSumm: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning

Arxiv

0+阅读 · 2022年12月6日

Learning Label Modular Prompts for Text Classification in the Wild

Arxiv

0+阅读 · 2022年12月5日

NIR-Prompt: A Multi-task Generalized Neural Information Retrieval Training Framework

Arxiv

0+阅读 · 2022年12月2日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

A Survey on Multi-Task Learning

Arxiv

31+阅读 · 2021年3月29日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

Pre-training Text Representations as Meta Learning

Arxiv

13+阅读 · 2020年4月12日

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

Arxiv

34+阅读 · 2019年10月24日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

VIP会员

文章信息

相关主题

小样本学习

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【伯克利博士论文】通过真实世界实践赋能机器人自主性

军用无人机集群技术尚未成熟——但潜力可期

人工智能安全治理白皮书（2025）

AgentOps综述：分类、挑战与未来方向

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Few-Shot Preference Learning for Human-in-the-Loop RL

Arxiv

0+阅读 · 2022年12月6日

UniSumm: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning

Arxiv

0+阅读 · 2022年12月6日

Learning Label Modular Prompts for Text Classification in the Wild

Arxiv

0+阅读 · 2022年12月5日

NIR-Prompt: A Multi-task Generalized Neural Information Retrieval Training Framework

Arxiv

0+阅读 · 2022年12月2日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

A Survey on Multi-Task Learning

Arxiv

31+阅读 · 2021年3月29日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

Pre-training Text Representations as Meta Learning

Arxiv

13+阅读 · 2020年4月12日

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

Arxiv

34+阅读 · 2019年10月24日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

相关基金

循环肿瘤细胞Stat3/Twist双信号通路交互作用对EMT编程的乳腺癌转移的调控与干预

国家自然科学基金

0+阅读 · 2014年12月31日

路面结构破坏行为无网格法分析与模拟

国家自然科学基金

0+阅读 · 2014年12月31日

多酸在二氧化钛纳米晶表面的自组装

国家自然科学基金

0+阅读 · 2014年12月31日

基于SAR的浅海水域地形反演研究

国家自然科学基金

0+阅读 · 2014年12月31日

稀疏植被覆盖条件下土壤盐渍化高光谱遥感定量反演与动态监测

国家自然科学基金

0+阅读 · 2014年12月31日

CLIC1在动脉粥样硬化过程内皮细胞损伤与炎症中的作用及丹参酮ⅡA的干预

国家自然科学基金

0+阅读 · 2013年12月31日

综合InSAR与GPS江苏沿海湿地储水量变化监测研究

国家自然科学基金

0+阅读 · 2012年12月31日

格式塔规律的几何推理关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

维生素E琥珀酸酯诱导胃癌细胞凋亡过程中内质网应激与氧化应激的交互作用

国家自然科学基金

0+阅读 · 2011年12月31日

星载高光谱热红外数据的温度与发射率分离算法研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员