利用回收利用参数有效提示减少再培训 (Reducing Retraining by Recycling Parameter-Efficient Prompts) - 专知论文

会员服务 ·

0

Prompt · 可约的 · MoDELS · Performer · Learning ·

2022 年 8 月 10 日

Reducing Retraining by Recycling Parameter-Efficient Prompts

翻译：利用回收利用参数有效提示减少再培训

Brian Lester,Joshua Yurtsever,Siamak Shakeri,Noah Constant

Parameter-efficient methods are able to use a single frozen pre-trained large language model (LLM) to perform many tasks by learning task-specific soft prompts that modulate model behavior when concatenated to the input text. However, these learned prompts are tightly coupled to a given frozen model -- if the model is updated, corresponding new prompts need to be obtained. In this work, we propose and investigate several approaches to "Prompt Recycling'" where a prompt trained on a source model is transformed to work with the new target model. Our methods do not rely on supervised pairs of prompts, task-specific data, or training updates with the target model, which would be just as costly as re-tuning prompts with the target model from scratch. We show that recycling between models is possible (our best settings are able to successfully recycle $88.9\%$ of prompts, producing a prompt that out-performs baselines), but significant performance headroom remains, requiring improved recycling techniques.

翻译：参数效率方法能够使用单一的冻结前训练前大型语言模型(LLM)来完成许多任务,通过学习特定任务软提示来调节与输入文本相融合的模范行为。然而,这些学得提示与特定冻结模式紧密结合 -- -- 如果模型更新,则需要获得相应的新提示。在这项工作中,我们提出并调查了几种“快速再循环”方法,在这些方法中,对源模型进行迅速培训后与新目标模型一起发挥作用。我们的方法并不依赖监督的一对提示、任务特定数据或与目标模型培训更新,而目标模型的模范与从零开始的目标模型的再调时一样昂贵。我们表明,在模型之间回收是可能的(我们的最佳环境能够成功地回收88.9 美元的提示,产生出一个超标准基准),但重要的性能头仍然需要改进回收技术。

0

相关内容

Prompt

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

Prompt Pre-training：迈向更强大的Parameter-Efficient Prompt Tuning

Prompt Pre-training：迈向更强大的Parameter-Efficient Prompt Tuning

PaperWeekly

8+阅读 · 2021年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

CBX8促进肝癌细胞增殖的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

偕二氟取代Combretastatins衍生物的设计与合成

国家自然科学基金

0+阅读 · 2014年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

前列腺癌干细胞雄激素受体甲基化在前列腺癌进展中分子机制的研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于少数民族地区小企业的信用风险模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

大型锻件热态几何参数在线测量仪

国家自然科学基金

0+阅读 · 2012年12月31日

硫化氢调节一氧化氮合成和蛋白质巯基亚硝基化延缓血管老化的实验研究

国家自然科学基金

0+阅读 · 2012年12月31日

ERG介导组蛋白修饰调控CRMP4失活启动前列腺癌转移的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

蓝藻prx基因家族成员功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

高炭产率多功能聚荧蒽的化学氧化合成

国家自然科学基金

0+阅读 · 2008年12月31日

Retrieval of Soft Prompt Enhances Zero-Shot Task Generalization

Arxiv

0+阅读 · 2022年10月6日

SynBench: Task-Agnostic Benchmarking of Pretrained Representations using Synthetic Data

Arxiv

0+阅读 · 2022年10月6日

Improving the Sample Efficiency of Prompt Tuning with Domain Adaptation

Arxiv

0+阅读 · 2022年10月6日

Fine-Tuning with Differential Privacy Necessitates an Additional Hyperparameter Search

Arxiv

0+阅读 · 2022年10月5日

Explaining Patterns in Data with Language Models via Interpretable Autoprompting

Arxiv

0+阅读 · 2022年10月4日

Exploring Parameter-Efficient Fine-tuning for Improving Communication Efficiency in Federated Learning

Arxiv

0+阅读 · 2022年10月4日

Complexity-Based Prompting for Multi-Step Reasoning

Arxiv

1+阅读 · 2022年10月3日

Multi-Prompt Alignment for Multi-source Unsupervised Domain Adaptation

Arxiv

1+阅读 · 2022年9月30日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

Adaptive Methods for Real-World Domain Generalization

Arxiv

13+阅读 · 2021年3月29日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《美国海军陆战队软件定义网络应用案例：分布式防火墙自动化系统》148页

《多体环境下定位导航授时（PNT）系统研究》228页

软件定义无线电（SDR）：商业与军事领域的技术、应用及未来趋势

《攻势防空作战中无人追击者/规避者最优轨迹研究（含动态交战区建模）》95页

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

Prompt Pre-training：迈向更强大的Parameter-Efficient Prompt Tuning

Prompt Pre-training：迈向更强大的Parameter-Efficient Prompt Tuning

PaperWeekly

8+阅读 · 2021年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Retrieval of Soft Prompt Enhances Zero-Shot Task Generalization

Arxiv

0+阅读 · 2022年10月6日

SynBench: Task-Agnostic Benchmarking of Pretrained Representations using Synthetic Data

Arxiv

0+阅读 · 2022年10月6日

Improving the Sample Efficiency of Prompt Tuning with Domain Adaptation

Arxiv

0+阅读 · 2022年10月6日

Fine-Tuning with Differential Privacy Necessitates an Additional Hyperparameter Search

Arxiv

0+阅读 · 2022年10月5日

Explaining Patterns in Data with Language Models via Interpretable Autoprompting

Arxiv

0+阅读 · 2022年10月4日

Exploring Parameter-Efficient Fine-tuning for Improving Communication Efficiency in Federated Learning

Arxiv

0+阅读 · 2022年10月4日

Complexity-Based Prompting for Multi-Step Reasoning

Arxiv

1+阅读 · 2022年10月3日

Multi-Prompt Alignment for Multi-source Unsupervised Domain Adaptation

Arxiv

1+阅读 · 2022年9月30日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

Adaptive Methods for Real-World Domain Generalization

Arxiv

13+阅读 · 2021年3月29日

相关基金

CBX8促进肝癌细胞增殖的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

偕二氟取代Combretastatins衍生物的设计与合成

国家自然科学基金

0+阅读 · 2014年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

前列腺癌干细胞雄激素受体甲基化在前列腺癌进展中分子机制的研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于少数民族地区小企业的信用风险模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

大型锻件热态几何参数在线测量仪

国家自然科学基金

0+阅读 · 2012年12月31日

硫化氢调节一氧化氮合成和蛋白质巯基亚硝基化延缓血管老化的实验研究

国家自然科学基金

0+阅读 · 2012年12月31日

ERG介导组蛋白修饰调控CRMP4失活启动前列腺癌转移的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

蓝藻prx基因家族成员功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

高炭产率多功能聚荧蒽的化学氧化合成

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员