视觉变换器 (Visual Transformer Pruning) - 专知论文

会员服务 ·

0

剪枝 · 变换 · Vision · 特化 · 可约的 ·

2021 年 7 月 14 日

Visual Transformer Pruning

翻译：视觉变换器

Mingjian Zhu,Kai Han,Yehui Tang,Yunhe Wang

from arxiv, Accepted by the KDD 2021 Workshop on Model Mining

Vision transformer has achieved competitive performance on a variety of computer vision applications. However, their storage, run-time memory, and computational demands are hindering the deployment to mobile devices. Here we present a vision transformer pruning approach, which identifies the impacts of dimensions in each layer of transformer and then executes pruning accordingly. By encouraging dimension-wise sparsity in the transformer, important dimensions automatically emerge. A great number of dimensions with small importance scores can be discarded to achieve a high pruning ratio without significantly compromising accuracy. The pipeline for vision transformer pruning is as follows: 1) training with sparsity regularization; 2) pruning dimensions of linear projections; 3) fine-tuning. The reduced parameters and FLOPs ratios of the proposed algorithm are well evaluated and analyzed on ImageNet dataset to demonstrate the effectiveness of our proposed method.

翻译：视觉变压器在各种计算机视觉应用中取得了竞争性的性能,然而,它们的存储、运行时间记忆和计算需求正在阻碍移动设备的部署。在这里,我们展示了视觉变压器裁剪方法,确定每个变压器层的维度影响,然后相应执行裁剪。通过鼓励变压器的维度宽度,重要的维度会自动显现出来。许多重要性小的维度可以被丢弃,以达到高裁剪率,而不会大大降低准确性。视觉变压器的管道如下:1) 与宽度规范化有关的培训;2) 线性预测的运行维度;3) 微调。拟议算法的降低参数和FLOP比率在图像网络数据集上得到了很好的评估和分析,以显示我们拟议方法的有效性。

0

相关内容

深度概率图模型，Deep Probabilistic Models

专知会员服务

29+阅读 · 2021年8月2日

【CVPR 2021】变换器跟踪TransT: Transformer Tracking

【CVPR 2021】变换器跟踪TransT: Transformer Tracking

专知会员服务

22+阅读 · 2021年4月20日

视频处理与压缩技术

专知会员服务

36+阅读 · 2021年2月20日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

【商汤科技】可变形Transformers端到端对象检测，Deformable DETR

【商汤科技】可变形Transformers端到端对象检测，Deformable DETR

专知会员服务

33+阅读 · 2020年10月11日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

已删除

将门创投

3+阅读 · 2019年5月6日

PnP-DETR: Towards Efficient Visual Analysis with Transformers

Arxiv

0+阅读 · 2021年9月15日

AdaPruner: Adaptive Channel Pruning and Effective Weights Inheritance

Arxiv

0+阅读 · 2021年9月14日

Block Pruning For Faster Transformers

Arxiv

0+阅读 · 2021年9月10日

Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer

Arxiv

2+阅读 · 2021年9月10日

End-to-End Video Instance Segmentation with Transformers

Arxiv

10+阅读 · 2021年3月24日

Colorization Transformer

Arxiv

9+阅读 · 2021年2月8日

A Survey on Visual Transformer

Arxiv

19+阅读 · 2020年12月23日

On Layer Normalization in the Transformer Architecture

Arxiv

4+阅读 · 2020年2月12日

Star-Transformer

Star-Transformer

Arxiv

5+阅读 · 2019年2月28日

The Evolved Transformer

The Evolved Transformer

Arxiv

5+阅读 · 2019年1月30日

VIP会员

文章信息

相关主题

相关VIP内容

深度概率图模型，Deep Probabilistic Models

专知会员服务

29+阅读 · 2021年8月2日

【CVPR 2021】变换器跟踪TransT: Transformer Tracking

【CVPR 2021】变换器跟踪TransT: Transformer Tracking

专知会员服务

22+阅读 · 2021年4月20日

视频处理与压缩技术

专知会员服务

36+阅读 · 2021年2月20日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

【商汤科技】可变形Transformers端到端对象检测，Deformable DETR

【商汤科技】可变形Transformers端到端对象检测，Deformable DETR

专知会员服务

33+阅读 · 2020年10月11日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】面向企业的图学习扩展：生产级图学习与推理，485页pdf

AI智能体编程：技术、挑战与机遇综述

【国家标准】数据安全技术数据安全风险评估方法

【CMU博士论文】交互式学习的进展：替代性反馈机制与自适应因果推理

相关资讯

已删除

将门创投

3+阅读 · 2019年5月6日

相关论文

PnP-DETR: Towards Efficient Visual Analysis with Transformers

Arxiv

0+阅读 · 2021年9月15日

AdaPruner: Adaptive Channel Pruning and Effective Weights Inheritance

Arxiv

0+阅读 · 2021年9月14日

Block Pruning For Faster Transformers

Arxiv

0+阅读 · 2021年9月10日

Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer

Arxiv

2+阅读 · 2021年9月10日

End-to-End Video Instance Segmentation with Transformers

Arxiv

10+阅读 · 2021年3月24日

Colorization Transformer

Arxiv

9+阅读 · 2021年2月8日

A Survey on Visual Transformer

Arxiv

19+阅读 · 2020年12月23日

On Layer Normalization in the Transformer Architecture

Arxiv

4+阅读 · 2020年2月12日

Star-Transformer

Star-Transformer

Arxiv

5+阅读 · 2019年2月28日

The Evolved Transformer

The Evolved Transformer

Arxiv

5+阅读 · 2019年1月30日

微信扫码咨询专知VIP会员