任务导向的内存高效剪枝适配器 (Task-oriented Memory-efficient Pruning-Adapter) - 专知论文

会员服务 ·

0

剪枝 · 内存 · 适配 · 代价 · 参数高效 ·

2023 年 4 月 6 日

Task-oriented Memory-efficient Pruning-Adapter

翻译：任务导向的内存高效剪枝适配器

Guorun Wang,Jun Yang,Yaoru Sun

The Outstanding performance and growing size of Large Language Models has led to increased attention in parameter efficient learning. The two predominant approaches are Adapters and Pruning. Adapters are to freeze the model and give it a new weight matrix on the side, which can significantly reduce the time and memory of training, but the cost is that the evaluation and testing will increase the time and memory consumption. Pruning is to cut off some weight and re-distribute the remaining weight, which sacrifices the complexity of training at the cost of extremely high memory and training time, making the cost of evaluation and testing relatively low. So efficiency of training and inference can't be obtained in the same time. In this work, we propose a task-oriented Pruning-Adapter method that achieve a high memory efficiency of training and memory, and speeds up training time and ensures no significant decrease in accuracy in GLUE tasks, achieving training and inference efficiency at the same time.

翻译：优秀的性能和日益增长的大型语言模型的大小，导致人们越来越关注参数高效学习。两种主要的方法是适配器和剪枝。适配器冻结模型，并在侧面给它一个新的权重矩阵，可以显著减少训练的时间和内存，但代价是评估和测试将增加时间和内存消耗。剪枝是切断一些权重并重新分配剩余权重，这牺牲了训练的复杂性，代价是内存和训练时间极高，降低了评估和测试的成本。因此，训练和推理效率不能同时获得。在这项工作中，我们提出了一种任务导向的剪枝适配器方法，在GLUE任务中实现了高效的训练内存和内存使用效率，并加快了训练时间，确保准确性没有显著降低，同时实现了训练和推理效率。

0

相关内容

【AAAI2023】FacT:视觉Transformer上轻量级自适应的因子精调

【AAAI2023】FacT:视觉Transformer上轻量级自适应的因子精调

专知会员服务

17+阅读 · 2022年12月8日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

320+阅读 · 2020年11月26日

【KDD2020】基于矩阵和张量因子分解的高效自动机器学习搜索，Efficient AutoML Pipeline Search with Matrix and Tensor Factorization

【KDD2020】基于矩阵和张量因子分解的高效自动机器学习搜索，Efficient AutoML Pipeline Search with Matrix and Tensor Factorization

专知会员服务

13+阅读 · 2020年6月10日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【ACL2020-MIT-韩松】用于高效自然语言处理的硬件感知Transformer

【ACL2020-MIT-韩松】用于高效自然语言处理的硬件感知Transformer

专知会员服务

24+阅读 · 2020年5月29日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【SIGMOD2020-CMU】在内存中搜索树的顺序保持键压缩，Order-Preserving Key Compression for In-Memory Search Trees

【SIGMOD2020-CMU】在内存中搜索树的顺序保持键压缩，Order-Preserving Key Compression for In-Memory Search Trees

专知会员服务

15+阅读 · 2020年3月7日

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

专知会员服务

28+阅读 · 2020年2月12日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

ECCV 2022 | 无需下游训练，Tip-Adapter大幅提升CLIP图像分类准确率

ECCV 2022 | 无需下游训练，Tip-Adapter大幅提升CLIP图像分类准确率

机器之心

4+阅读 · 2022年9月25日

由浅入深详解NLP中的Adapter技术

由浅入深详解NLP中的Adapter技术

PaperWeekly

7+阅读 · 2022年7月21日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

AINLP

12+阅读 · 2018年11月1日

汉英篇章衔接对齐资源构建与分析研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于机会路由的数据分流及共享最优化研究

国家自然科学基金

0+阅读 · 2015年12月31日

金纳米粒子的空间分布调控对聚合物太阳能电池性能的优化研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向众核计算的数值方法协同设计--一种高效且高精度广义有限元方法研究

国家自然科学基金

2+阅读 · 2014年12月31日

针对GPU的高效并行任务执行设计研究

国家自然科学基金

0+阅读 · 2013年12月31日

异构云环境下能耗高效调度模型与优化方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

虚拟化环境中高效节能的内存资源动态管理技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

窄能隙给体聚合物能级的调节与高LUMO能级富勒烯受体光伏特性研究

国家自然科学基金

0+阅读 · 2011年12月31日

面向可重构Gbps VLSI的MIMO检测关键技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

柔性结构重复撞击的研究

国家自然科学基金

0+阅读 · 2008年12月31日

MixFormerV2: Efficient Fully Transformer Tracking

Arxiv

0+阅读 · 2023年5月25日

Dynamic Context Pruning for Efficient and Interpretable Autoregressive Transformers

Arxiv

0+阅读 · 2023年5月25日

READ: Recurrent Adaptation of Large Transformers

Arxiv

0+阅读 · 2023年5月24日

Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of Language Model

Arxiv

0+阅读 · 2023年5月24日

Decoder Tuning: Efficient Language Understanding as Decoding

Arxiv

0+阅读 · 2023年5月24日

SmartTrim: Adaptive Tokens and Parameters Pruning for Efficient Vision-Language Models

Arxiv

0+阅读 · 2023年5月24日

NAIL: Lexical Retrieval Indices with Efficient Non-Autoregressive Decoders

Arxiv

0+阅读 · 2023年5月23日

Memory-Efficient Fine-Tuning of Compressed Large Language Models via sub-4-bit Integer Quantization

Arxiv

0+阅读 · 2023年5月23日

Narrative XL: A Large-scale Dataset For Long-Term Memory Models

Arxiv

0+阅读 · 2023年5月23日

Adaptive Correlation Filters with Long-Term and Short-Term Memory for Object Tracking

Arxiv

11+阅读 · 2018年3月23日

VIP会员

文章信息

相关主题

相关VIP内容

【AAAI2023】FacT:视觉Transformer上轻量级自适应的因子精调

【AAAI2023】FacT:视觉Transformer上轻量级自适应的因子精调

专知会员服务

17+阅读 · 2022年12月8日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

320+阅读 · 2020年11月26日

【KDD2020】基于矩阵和张量因子分解的高效自动机器学习搜索，Efficient AutoML Pipeline Search with Matrix and Tensor Factorization

【KDD2020】基于矩阵和张量因子分解的高效自动机器学习搜索，Efficient AutoML Pipeline Search with Matrix and Tensor Factorization

专知会员服务

13+阅读 · 2020年6月10日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【ACL2020-MIT-韩松】用于高效自然语言处理的硬件感知Transformer

【ACL2020-MIT-韩松】用于高效自然语言处理的硬件感知Transformer

专知会员服务

24+阅读 · 2020年5月29日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【SIGMOD2020-CMU】在内存中搜索树的顺序保持键压缩，Order-Preserving Key Compression for In-Memory Search Trees

【SIGMOD2020-CMU】在内存中搜索树的顺序保持键压缩，Order-Preserving Key Compression for In-Memory Search Trees

专知会员服务

15+阅读 · 2020年3月7日

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

专知会员服务

28+阅读 · 2020年2月12日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

联邦图学习的全面数据中心化综述

基于脉冲神经网络的边缘智能

LaCache：用于高效长上下文建模的大语言模型梯状KV缓存机制

【CMU博士论文】可解释的图与时间序列挖掘：算法与应用

相关资讯

ECCV 2022 | 无需下游训练，Tip-Adapter大幅提升CLIP图像分类准确率

ECCV 2022 | 无需下游训练，Tip-Adapter大幅提升CLIP图像分类准确率

机器之心

4+阅读 · 2022年9月25日

由浅入深详解NLP中的Adapter技术

由浅入深详解NLP中的Adapter技术

PaperWeekly

7+阅读 · 2022年7月21日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

AINLP

12+阅读 · 2018年11月1日

相关论文

MixFormerV2: Efficient Fully Transformer Tracking

Arxiv

0+阅读 · 2023年5月25日

Dynamic Context Pruning for Efficient and Interpretable Autoregressive Transformers

Arxiv

0+阅读 · 2023年5月25日

READ: Recurrent Adaptation of Large Transformers

Arxiv

0+阅读 · 2023年5月24日

Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of Language Model

Arxiv

0+阅读 · 2023年5月24日

Decoder Tuning: Efficient Language Understanding as Decoding

Arxiv

0+阅读 · 2023年5月24日

SmartTrim: Adaptive Tokens and Parameters Pruning for Efficient Vision-Language Models

Arxiv

0+阅读 · 2023年5月24日

NAIL: Lexical Retrieval Indices with Efficient Non-Autoregressive Decoders

Arxiv

0+阅读 · 2023年5月23日

Memory-Efficient Fine-Tuning of Compressed Large Language Models via sub-4-bit Integer Quantization

Arxiv

0+阅读 · 2023年5月23日

Narrative XL: A Large-scale Dataset For Long-Term Memory Models

Arxiv

0+阅读 · 2023年5月23日

Adaptive Correlation Filters with Long-Term and Short-Term Memory for Object Tracking

Arxiv

11+阅读 · 2018年3月23日

相关基金

汉英篇章衔接对齐资源构建与分析研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于机会路由的数据分流及共享最优化研究

国家自然科学基金

0+阅读 · 2015年12月31日

金纳米粒子的空间分布调控对聚合物太阳能电池性能的优化研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向众核计算的数值方法协同设计--一种高效且高精度广义有限元方法研究

国家自然科学基金

2+阅读 · 2014年12月31日

针对GPU的高效并行任务执行设计研究

国家自然科学基金

0+阅读 · 2013年12月31日

异构云环境下能耗高效调度模型与优化方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

虚拟化环境中高效节能的内存资源动态管理技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

窄能隙给体聚合物能级的调节与高LUMO能级富勒烯受体光伏特性研究

国家自然科学基金

0+阅读 · 2011年12月31日

面向可重构Gbps VLSI的MIMO检测关键技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

柔性结构重复撞击的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员