稀疏ViT：重访激活稀疏性以实现高效高分辨率Vision Transformer (SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer) - 专知论文

会员服务 ·

0

稀疏 · 稀疏性 · 高分辨率 · 高分辨 · SparseViT ·

2023 年 3 月 30 日

SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer

翻译：稀疏ViT：重访激活稀疏性以实现高效高分辨率Vision Transformer

Xuanyao Chen,Zhijian Liu,Haotian Tang,Li Yi,Hang Zhao,Song Han

from arxiv, CVPR 2023. The first two authors contributed equally to this work. Project page: https://sparsevit.mit.edu

High-resolution images enable neural networks to learn richer visual representations. However, this improved performance comes at the cost of growing computational complexity, hindering their usage in latency-sensitive applications. As not all pixels are equal, skipping computations for less-important regions offers a simple and effective measure to reduce the computation. This, however, is hard to be translated into actual speedup for CNNs since it breaks the regularity of the dense convolution workload. In this paper, we introduce SparseViT that revisits activation sparsity for recent window-based vision transformers (ViTs). As window attentions are naturally batched over blocks, actual speedup with window activation pruning becomes possible: i.e., ~50% latency reduction with 60% sparsity. Different layers should be assigned with different pruning ratios due to their diverse sensitivities and computational costs. We introduce sparsity-aware adaptation and apply the evolutionary search to efficiently find the optimal layerwise sparsity configuration within the vast search space. SparseViT achieves speedups of 1.5x, 1.4x, and 1.3x compared to its dense counterpart in monocular 3D object detection, 2D instance segmentation, and 2D semantic segmentation, respectively, with negligible to no loss of accuracy.

翻译：高分辨率图像可以使神经网络学习到更丰富的视觉表示。然而，这种改进的性能是以不断增长的计算复杂度为代价的，限制了它们在延迟敏感型应用中的使用。由于并非所有像素都是相等的，因此跳过对不重要区域的计算可以提供一种简单有效的措施来减少计算。然而，由于这破坏了密集卷积负载的规律性，因此很难将其转化为CNN的实际加速。在本文中，我们引入了SparseViT，该算法重访了最近的基于窗口的Vision Transformer（ViTs）中的激活稀疏性。由于窗口注意力自然地被批处理为块，因此使用窗口激活剪枝可以实现实际的加速：即通过不到60％的稀疏性实现约50％的延迟降低。由于各层的敏感度和计算成本不同，因此应为不同的层分配不同的修剪比例。我们引入了稀疏感知适应和应用进化搜索，在广阔的搜索空间中有效地找到了最佳的层稀疏配置。SparseViT在单目三维物体检测，二维实例分割和二维语义分割中与其密集对应方法相比实现了1.5x，1.4x和1.3x的加速，几乎没有损失精度。

2

相关内容

【CVPR2023】SparseViT:重新审视高效高分辨率视觉Transformer的激活稀疏性

【CVPR2023】SparseViT:重新审视高效高分辨率视觉Transformer的激活稀疏性

专知会员服务

15+阅读 · 2023年4月2日

【CVPR2022】基于知识蒸馏的高效预训练

【CVPR2022】基于知识蒸馏的高效预训练

专知会员服务

32+阅读 · 2022年4月23日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【Google】高效Transformer综述，Efficient Transformers: A Survey

【Google】高效Transformer综述，Efficient Transformers: A Survey

专知会员服务

66+阅读 · 2022年3月17日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

【Google】最新《高效Transformers》综述大全，Efficient Transformers: A Survey

【Google】最新《高效Transformers》综述大全，Efficient Transformers: A Survey

专知会员服务

113+阅读 · 2020年9月17日

可解释高效异构图卷积网络，Interpretable and Efficient Heterogeneous Graph Convolutional Network

可解释高效异构图卷积网络，Interpretable and Efficient Heterogeneous Graph Convolutional Network

专知会员服务

63+阅读 · 2020年7月12日

Google AI博客解读论文《Reformer: The Efficient Transformer》，百万量级注意力机制

Google AI博客解读论文《Reformer: The Efficient Transformer》，百万量级注意力机制

专知会员服务

70+阅读 · 2020年1月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

14+阅读 · 2019年5月15日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新六篇主题模型相关论文—动态主题模型、主题趋势、大规模并行采样、随机采样、非参主题建模

【论文推荐】最新六篇主题模型相关论文—动态主题模型、主题趋势、大规模并行采样、随机采样、非参主题建模

专知

14+阅读 · 2018年6月24日

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

专知

20+阅读 · 2018年4月7日

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

专知

13+阅读 · 2018年1月23日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

压缩感知与稀疏信号恢复

国家自然科学基金

2+阅读 · 2014年12月31日

水声传感器网络的高效时间同步与定位

国家自然科学基金

0+阅读 · 2013年12月31日

基于超稀疏结构学习的压缩感知重建研究

国家自然科学基金

5+阅读 · 2013年12月31日

SAR和可见光图像的脉冲耦合神经网络分层感知融合研究

国家自然科学基金

0+阅读 · 2013年12月31日

压缩感知域高光谱数据高效压缩方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于低密度奇偶校验码的压缩感知系统设计与实现

国家自然科学基金

0+阅读 · 2012年12月31日

基于自适应压缩感知的地震信号稀疏表示与高效重构

国家自然科学基金

0+阅读 · 2012年12月31日

波长交错高采样率高精度光电模数转换器的研究

国家自然科学基金

0+阅读 · 2012年12月31日

宇宙暗物质和弱引力透镜功率谱的信息量研究

国家自然科学基金

0+阅读 · 2011年12月31日

压缩感知中采样与重建的理论及算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

Tune-Mode ConvBN Blocks For Efficient Transfer Learning

Arxiv

0+阅读 · 2023年5月19日

Efficient Mixed Transformer for Single Image Super-Resolution

Arxiv

0+阅读 · 2023年5月19日

T-former: An Efficient Transformer for Image Inpainting

Arxiv

0+阅读 · 2023年5月19日

Deep Temporal Graph Clustering

Arxiv

0+阅读 · 2023年5月18日

CageViT: Convolutional Activation Guided Efficient Vision Transformer

Arxiv

0+阅读 · 2023年5月17日

Beyond Just Vision: A Review on Self-Supervised Representation Learning on Multimodal and Temporal Data

Arxiv

28+阅读 · 2022年6月8日

A Comprehensive Survey and Performance Analysis of Activation Functions in Deep Learning

A Comprehensive Survey and Performance Analysis of Activation Functions in Deep Learning

Arxiv

23+阅读 · 2021年9月29日

SVT-Net: Super Light-Weight Sparse Voxel Transformer for Large Scale Place Recognition

Arxiv

12+阅读 · 2021年5月30日

Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks

Arxiv

14+阅读 · 2021年1月31日

Efficient Transformers: A Survey

Arxiv

23+阅读 · 2020年9月16日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR2023】SparseViT:重新审视高效高分辨率视觉Transformer的激活稀疏性

【CVPR2023】SparseViT:重新审视高效高分辨率视觉Transformer的激活稀疏性

专知会员服务

15+阅读 · 2023年4月2日

【CVPR2022】基于知识蒸馏的高效预训练

【CVPR2022】基于知识蒸馏的高效预训练

专知会员服务

32+阅读 · 2022年4月23日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【Google】高效Transformer综述，Efficient Transformers: A Survey

【Google】高效Transformer综述，Efficient Transformers: A Survey

专知会员服务

66+阅读 · 2022年3月17日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

【Google】最新《高效Transformers》综述大全，Efficient Transformers: A Survey

【Google】最新《高效Transformers》综述大全，Efficient Transformers: A Survey

专知会员服务

113+阅读 · 2020年9月17日

可解释高效异构图卷积网络，Interpretable and Efficient Heterogeneous Graph Convolutional Network

可解释高效异构图卷积网络，Interpretable and Efficient Heterogeneous Graph Convolutional Network

专知会员服务

63+阅读 · 2020年7月12日

Google AI博客解读论文《Reformer: The Efficient Transformer》，百万量级注意力机制

Google AI博客解读论文《Reformer: The Efficient Transformer》，百万量级注意力机制

专知会员服务

70+阅读 · 2020年1月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

14+阅读 · 2019年5月15日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新六篇主题模型相关论文—动态主题模型、主题趋势、大规模并行采样、随机采样、非参主题建模

【论文推荐】最新六篇主题模型相关论文—动态主题模型、主题趋势、大规模并行采样、随机采样、非参主题建模

专知

14+阅读 · 2018年6月24日

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

专知

20+阅读 · 2018年4月7日

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

专知

13+阅读 · 2018年1月23日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

相关论文

Tune-Mode ConvBN Blocks For Efficient Transfer Learning

Arxiv

0+阅读 · 2023年5月19日

Efficient Mixed Transformer for Single Image Super-Resolution

Arxiv

0+阅读 · 2023年5月19日

T-former: An Efficient Transformer for Image Inpainting

Arxiv

0+阅读 · 2023年5月19日

Deep Temporal Graph Clustering

Arxiv

0+阅读 · 2023年5月18日

CageViT: Convolutional Activation Guided Efficient Vision Transformer

Arxiv

0+阅读 · 2023年5月17日

Beyond Just Vision: A Review on Self-Supervised Representation Learning on Multimodal and Temporal Data

Arxiv

28+阅读 · 2022年6月8日

A Comprehensive Survey and Performance Analysis of Activation Functions in Deep Learning

A Comprehensive Survey and Performance Analysis of Activation Functions in Deep Learning

Arxiv

23+阅读 · 2021年9月29日

SVT-Net: Super Light-Weight Sparse Voxel Transformer for Large Scale Place Recognition

Arxiv

12+阅读 · 2021年5月30日

Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks

Arxiv

14+阅读 · 2021年1月31日

Efficient Transformers: A Survey

Arxiv

23+阅读 · 2020年9月16日

相关基金

压缩感知与稀疏信号恢复

国家自然科学基金

2+阅读 · 2014年12月31日

水声传感器网络的高效时间同步与定位

国家自然科学基金

0+阅读 · 2013年12月31日

基于超稀疏结构学习的压缩感知重建研究

国家自然科学基金

5+阅读 · 2013年12月31日

SAR和可见光图像的脉冲耦合神经网络分层感知融合研究

国家自然科学基金

0+阅读 · 2013年12月31日

压缩感知域高光谱数据高效压缩方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于低密度奇偶校验码的压缩感知系统设计与实现

国家自然科学基金

0+阅读 · 2012年12月31日

基于自适应压缩感知的地震信号稀疏表示与高效重构

国家自然科学基金

0+阅读 · 2012年12月31日

波长交错高采样率高精度光电模数转换器的研究

国家自然科学基金

0+阅读 · 2012年12月31日

宇宙暗物质和弱引力透镜功率谱的信息量研究

国家自然科学基金

0+阅读 · 2011年12月31日

压缩感知中采样与重建的理论及算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员