S2 引擎: 用于无序进化神经网络的新元元元元元结构 (S2Engine: A Novel Systolic Architecture for Sparse Convolutional Neural Networks) - 专知论文

会员服务 ·

0

特化 · Neural Networks · 卷积 · Networking · 卷积神经网络 ·

2021 年 6 月 15 日

S2Engine: A Novel Systolic Architecture for Sparse Convolutional Neural Networks

翻译：S2 引擎: 用于无序进化神经网络的新元元元元元结构

Jianlei Yang,Wenzhi Fu,Xingzhou Cheng,Xucheng Ye,Pengcheng Dai,Weisheng Zhao

from arxiv, 13 pages, 17 figures

Convolutional neural networks (CNNs) have achieved great success in performing cognitive tasks. However, execution of CNNs requires a large amount of computing resources and generates heavy memory traffic, which imposes a severe challenge on computing system design. Through optimizing parallel executions and data reuse in convolution, systolic architecture demonstrates great advantages in accelerating CNN computations. However, regular internal data transmission path in traditional systolic architecture prevents the systolic architecture from completely leveraging the benefits introduced by neural network sparsity. Deployment of fine-grained sparsity on the existing systolic architectures is greatly hindered by the incurred computational overheads. In this work, we propose S2Engine $-$ a novel systolic architecture that can fully exploit the sparsity in CNNs with maximized data reuse. S2Engine transmits compressed data internally and allows each processing element to dynamically select an aligned data from the compressed dataflow in convolution. Compared to the naive systolic array, S2Engine achieves about $3.2\times$ and about $3.0\times$ improvements on speed and energy efficiency, respectively.

翻译：然而,执行CNN需要大量计算资源,并产生大量的记忆流量,这对计算系统设计构成严重挑战。通过优化同步处决和数据再利用,静态架构在加速CNN计算方面显示出巨大的优势。然而,传统静态架构中的常规内部数据传输路径阻止了静态架构完全利用神经网络孔径效应带来的惠益。对现有静态结构部署细微的静默性受到计算间接费用的极大阻碍。在这项工作中,我们提议S2Engine $-$,这是一个新型的静态架构,可以充分利用CNN的松散性,最大限度地重复使用数据。S2Engine 内部传输压缩数据,允许每个处理元素动态地从电流中选择压缩数据流流流的匹配数据。与天性静脉阵列相比,S2Engine在速度和能源效率上分别实现了约3.2美元和约3.0美元。

0

相关内容

【NeurIPS2020】点针图网络，Pointer Graph Networks

【NeurIPS2020】点针图网络，Pointer Graph Networks

专知会员服务

40+阅读 · 2020年9月27日

【ICLR-2020】网络反卷积，NETWORK DECONVOLUTION

【ICLR-2020】网络反卷积，NETWORK DECONVOLUTION

专知会员服务

39+阅读 · 2020年2月21日

深度卷积神经网络的最新架构综述，A Survey of the Recent Architectures of Deep Convolutional Neural Networks

深度卷积神经网络的最新架构综述，A Survey of the Recent Architectures of Deep Convolutional Neural Networks

专知会员服务

48+阅读 · 2020年2月15日

【新书】高级应用深度学习，卷积神经网络和目标检测（Advanced Applied Deep Learning ，Convolutional Neural Networks and Object Detection），附294页pdf

【新书】高级应用深度学习，卷积神经网络和目标检测（Advanced Applied Deep Learning ，Convolutional Neural Networks and Object Detection），附294页pdf

专知会员服务

95+阅读 · 2020年1月9日

【论文|Google】基于元学习的排序架构，Ranking architectures using meta-learning

【论文|Google】基于元学习的排序架构，Ranking architectures using meta-learning

专知会员服务

18+阅读 · 2019年11月30日

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

专知会员服务

31+阅读 · 2019年11月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

Yoshua Bengio，使算法知道“为什么”

Yoshua Bengio，使算法知道“为什么”

专知会员服务

8+阅读 · 2019年10月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【CNN】一文读懂卷积神经网络CNN

【CNN】一文读懂卷积神经网络CNN

产业智能官

18+阅读 · 2018年1月2日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

【推荐】TensorFlow手把手CNN实践指南

【推荐】TensorFlow手把手CNN实践指南

机器学习研究会

5+阅读 · 2017年8月17日

Interpolation-Aware Padding for 3D Sparse Convolutional Neural Networks

Arxiv

0+阅读 · 2021年8月16日

CONet: Channel Optimization for Convolutional Neural Networks

Arxiv

0+阅读 · 2021年8月15日

EEEA-Net: An Early Exit Evolutionary Neural Architecture Search

Arxiv

0+阅读 · 2021年8月13日

AdaXpert: Adapting Neural Architecture for Growing Data

Arxiv

4+阅读 · 2021年7月1日

Neural Architecture Generator Optimization

Arxiv

6+阅读 · 2020年10月8日

Auto-GNN: Neural Architecture Search of Graph Neural Networks

Auto-GNN: Neural Architecture Search of Graph Neural Networks

Arxiv

7+阅读 · 2019年9月10日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

Neural Architecture Optimization

Neural Architecture Optimization

Arxiv

8+阅读 · 2018年9月5日

Tiny SSD: A Tiny Single-shot Detection Deep Convolutional Neural Network for Real-time Embedded Object Detection

Arxiv

7+阅读 · 2018年2月19日

Pointer Networks

Arxiv

4+阅读 · 2017年1月2日

VIP会员

文章信息

相关主题

Neural Networks

卷积神经网络

相关VIP内容

【NeurIPS2020】点针图网络，Pointer Graph Networks

【NeurIPS2020】点针图网络，Pointer Graph Networks

专知会员服务

40+阅读 · 2020年9月27日

【ICLR-2020】网络反卷积，NETWORK DECONVOLUTION

【ICLR-2020】网络反卷积，NETWORK DECONVOLUTION

专知会员服务

39+阅读 · 2020年2月21日

深度卷积神经网络的最新架构综述，A Survey of the Recent Architectures of Deep Convolutional Neural Networks

深度卷积神经网络的最新架构综述，A Survey of the Recent Architectures of Deep Convolutional Neural Networks

专知会员服务

48+阅读 · 2020年2月15日

【新书】高级应用深度学习，卷积神经网络和目标检测（Advanced Applied Deep Learning ，Convolutional Neural Networks and Object Detection），附294页pdf

【新书】高级应用深度学习，卷积神经网络和目标检测（Advanced Applied Deep Learning ，Convolutional Neural Networks and Object Detection），附294页pdf

专知会员服务

95+阅读 · 2020年1月9日

【论文|Google】基于元学习的排序架构，Ranking architectures using meta-learning

【论文|Google】基于元学习的排序架构，Ranking architectures using meta-learning

专知会员服务

18+阅读 · 2019年11月30日

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

专知会员服务

31+阅读 · 2019年11月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

Yoshua Bengio，使算法知道“为什么”

Yoshua Bengio，使算法知道“为什么”

专知会员服务

8+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《军事域人工智能风险、机遇与治理战略指导报告》2025最新76页报告

《杀伤网与精确规模：智能饱和战争时代的战略要务-印度视角》2025最新报告

俄乌冲突的地缘政治与军事教训（万字长文）

《弹药快速效能建模：推进互操作性与技术优势》2025最新26页报告

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【CNN】一文读懂卷积神经网络CNN

【CNN】一文读懂卷积神经网络CNN

产业智能官

18+阅读 · 2018年1月2日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

【推荐】TensorFlow手把手CNN实践指南

【推荐】TensorFlow手把手CNN实践指南

机器学习研究会

5+阅读 · 2017年8月17日

相关论文

Interpolation-Aware Padding for 3D Sparse Convolutional Neural Networks

Arxiv

0+阅读 · 2021年8月16日

CONet: Channel Optimization for Convolutional Neural Networks

Arxiv

0+阅读 · 2021年8月15日

EEEA-Net: An Early Exit Evolutionary Neural Architecture Search

Arxiv

0+阅读 · 2021年8月13日

AdaXpert: Adapting Neural Architecture for Growing Data

Arxiv

4+阅读 · 2021年7月1日

Neural Architecture Generator Optimization

Arxiv

6+阅读 · 2020年10月8日

Auto-GNN: Neural Architecture Search of Graph Neural Networks

Auto-GNN: Neural Architecture Search of Graph Neural Networks

Arxiv

7+阅读 · 2019年9月10日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

Neural Architecture Optimization

Neural Architecture Optimization

Arxiv

8+阅读 · 2018年9月5日

Tiny SSD: A Tiny Single-shot Detection Deep Convolutional Neural Network for Real-time Embedded Object Detection

Arxiv

7+阅读 · 2018年2月19日

Pointer Networks

Arxiv

4+阅读 · 2017年1月2日

微信扫码咨询专知VIP会员