UNIPELT: 参数有效语言模式统一框架 (UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning) - 专知论文

会员服务 ·

0

Performer · 语言模型化 · tuning · MoDELS · 模型选择 ·

2022 年 9 月 4 日

UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning

翻译：UNIPELT: 参数有效语言模式统一框架

Yuning Mao,Lambert Mathias,Rui Hou,Amjad Almahairi,Hao Ma,Jiawei Han,Wen-tau Yih,Madian Khabsa

from arxiv, ACL 2022 (w. typo fixes)

Recent parameter-efficient language model tuning (PELT) methods manage to match the performance of fine-tuning with much fewer trainable parameters and perform especially well when training data is limited. However, different PELT methods may perform rather differently on the same task, making it nontrivial to select the most appropriate method for a specific task, especially considering the fast-growing number of new PELT methods and tasks. In light of model diversity and the difficulty of model selection, we propose a unified framework, UniPELT, which incorporates different PELT methods as submodules and learns to activate the ones that best suit the current data or task setup via gating mechanism. On the GLUE benchmark, UniPELT consistently achieves 1~4% gains compared to the best individual PELT method that it incorporates and even outperforms fine-tuning under different setups. Moreover, UniPELT generally surpasses the upper bound that takes the best performance of all its submodules used individually on each task, indicating that a mixture of multiple PELT methods may be inherently more effective than single methods.

翻译：最近具有参数效率的语言模型调制方法(PELT)与微调的性能相匹配,培训参数少得多,培训数据有限时效果特别好。然而,不同的PELT方法在相同任务上的表现可能不同,使得为具体任务选择最适当的方法非三重性,特别是考虑到新的PELT方法和任务的数量迅速增长,特别是考虑到新的PELT方法和任务的数量迅速增长,鉴于模型的多样性和选择模式的困难,我们建议一个统一的框架UNIPELT,它将不同的PELT方法作为子模块,并学习如何激活最适合当前数据或任务设置的参数。在GLUE基准方面,UPELT始终能取得1~4%的收益,而与它吸收、甚至超越不同设置下的最佳单个的PELT方法相比。此外,UPELT通常超过其每个任务使用的所有子模块的最佳性能的上限,表明多种PELT方法的混合物在本质上比单一方法更有效。

0

相关内容

Performer

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

Parameter-Efficient Fine-tuning 相关工作梳理

Parameter-Efficient Fine-tuning 相关工作梳理

PaperWeekly

1+阅读 · 2022年3月19日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Prompt Pre-training：迈向更强大的Parameter-Efficient Prompt Tuning

Prompt Pre-training：迈向更强大的Parameter-Efficient Prompt Tuning

PaperWeekly

8+阅读 · 2021年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

TRAP1在赭曲霉毒素A干扰肾细胞凋亡与自噬内稳态中的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

SUMO特异性蛋白酶SENP1介导的Sp1去SUMO化修饰异常在Nano-Co诱导细胞恶性转化中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

PKCα与UNC5B相互作用调控膀胱癌细胞药物敏感性的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

线粒体TRAP1抑制肾小管上皮细胞凋亡在肾间质纤维化中的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

木框架外墙气候调控机理与优化设计

国家自然科学基金

0+阅读 · 2012年12月31日

新型免疫负调控分子TIPE2调控CD4+T细胞的功能及在HBV感染中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

脊髓细胞特异性miRNAs调控损伤运动神经元凋亡的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

一类波动方程反问题的数值求解

国家自然科学基金

0+阅读 · 2008年12月31日

超支化聚合物/硅的多维、多尺度自组装研究

国家自然科学基金

0+阅读 · 2008年12月31日

面向多核处理器的硬软件协作Transactional Memory系统结构

国家自然科学基金

0+阅读 · 2008年12月31日

Improving Stability of Fine-Tuning Pretrained Language Models via Component-Wise Gradient Norm Clipping

Arxiv

0+阅读 · 2022年10月19日

ROSE: Robust Selective Fine-tuning for Pre-trained Language Models

Arxiv

0+阅读 · 2022年10月18日

Discrete Cross-Modal Alignment Enables Zero-Shot Speech Translation

Arxiv

0+阅读 · 2022年10月18日

Zero-Shot Ranking Socio-Political Texts with Transformer Language Models to Reduce Close Reading Time

Arxiv

0+阅读 · 2022年10月17日

Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data

Arxiv

0+阅读 · 2022年10月16日

Zero-Shot Learners for Natural Language Understanding via a Unified Multiple Choice Perspective

Arxiv

0+阅读 · 2022年10月16日

A Survey of Parameters Associated with the Quality of Benchmarks in NLP

Arxiv

0+阅读 · 2022年10月14日

DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank Adaptation

Arxiv

0+阅读 · 2022年10月14日

Building Markovian Generative Architectures over Pretrained LM Backbones for Efficient Task-Oriented Dialog Systems

Arxiv

0+阅读 · 2022年10月14日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICCV2025教程】基础模型遇见具身智能体

军事机器学习设计：关于开发自动化任务摘要系统的梯次化设计科学研究 | 2025最新93页

扩散模型中的缓存方法综述：迈向高效的多模态生成

【ICCV2025教程】《迈向视觉语言模型的全面推理》

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

Parameter-Efficient Fine-tuning 相关工作梳理

Parameter-Efficient Fine-tuning 相关工作梳理

PaperWeekly

1+阅读 · 2022年3月19日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Prompt Pre-training：迈向更强大的Parameter-Efficient Prompt Tuning

Prompt Pre-training：迈向更强大的Parameter-Efficient Prompt Tuning

PaperWeekly

8+阅读 · 2021年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

相关论文

Improving Stability of Fine-Tuning Pretrained Language Models via Component-Wise Gradient Norm Clipping

Arxiv

0+阅读 · 2022年10月19日

ROSE: Robust Selective Fine-tuning for Pre-trained Language Models

Arxiv

0+阅读 · 2022年10月18日

Discrete Cross-Modal Alignment Enables Zero-Shot Speech Translation

Arxiv

0+阅读 · 2022年10月18日

Zero-Shot Ranking Socio-Political Texts with Transformer Language Models to Reduce Close Reading Time

Arxiv

0+阅读 · 2022年10月17日

Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data

Arxiv

0+阅读 · 2022年10月16日

Zero-Shot Learners for Natural Language Understanding via a Unified Multiple Choice Perspective

Arxiv

0+阅读 · 2022年10月16日

A Survey of Parameters Associated with the Quality of Benchmarks in NLP

Arxiv

0+阅读 · 2022年10月14日

DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank Adaptation

Arxiv

0+阅读 · 2022年10月14日

Building Markovian Generative Architectures over Pretrained LM Backbones for Efficient Task-Oriented Dialog Systems

Arxiv

0+阅读 · 2022年10月14日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

相关基金

TRAP1在赭曲霉毒素A干扰肾细胞凋亡与自噬内稳态中的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

SUMO特异性蛋白酶SENP1介导的Sp1去SUMO化修饰异常在Nano-Co诱导细胞恶性转化中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

PKCα与UNC5B相互作用调控膀胱癌细胞药物敏感性的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

线粒体TRAP1抑制肾小管上皮细胞凋亡在肾间质纤维化中的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

木框架外墙气候调控机理与优化设计

国家自然科学基金

0+阅读 · 2012年12月31日

新型免疫负调控分子TIPE2调控CD4+T细胞的功能及在HBV感染中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

脊髓细胞特异性miRNAs调控损伤运动神经元凋亡的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

一类波动方程反问题的数值求解

国家自然科学基金

0+阅读 · 2008年12月31日

超支化聚合物/硅的多维、多尺度自组装研究

国家自然科学基金

0+阅读 · 2008年12月31日

面向多核处理器的硬软件协作Transactional Memory系统结构

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员