使用无线网络对包套为 2.5D DNN 加速器设计数据流- 建筑设计共同设计 (Dataflow-Architecture Co-Design for 2.5D DNN Accelerators using Wireless Network-on-Package) - 专知论文

会员服务 ·

0

DNN · 可辨认的 · Continuity · Integration · Buffer（公司） ·

2020 年 11 月 30 日

Dataflow-Architecture Co-Design for 2.5D DNN Accelerators using Wireless Network-on-Package

翻译：使用无线网络对包套为 2.5D DNN 加速器设计数据流- 建筑设计共同设计

Robert Guirado,Hyoukjun Kwon,Sergi Abadal,Eduard Alarcón,Tushar Krishna

from arxiv, ASPDAC '21

Deep neural network (DNN) models continue to grow in size and complexity, demanding higher computational power to enable real-time inference. To efficiently deliver such computational demands, hardware accelerators are being developed and deployed across scales. This naturally requires an efficient scale-out mechanism for increasing compute density as required by the application. 2.5D integration over interposer has emerged as a promising solution, but as we show in this work, the limited interposer bandwidth and multiple hops in the Network-on-Package (NoP) can diminish the benefits of the approach. To cope with this challenge, we propose WIENNA, a wireless NoP-based 2.5D DNN accelerator. In WIENNA, the wireless NoP connects an array of DNN accelerator chiplets to the global buffer chiplet, providing high-bandwidth multicasting capabilities. Here, we also identify the dataflow style that most efficienty exploits the wireless NoP's high-bandwidth multicasting capability on each layer. With modest area and power overheads, WIENNA achieves 2.2X--5.1X higher throughput and 38.2% lower energy than an interposer-based NoP design.

翻译：深心神经网络(DNN) 模型在规模和复杂性上继续增长,要求更高的计算能力以促成实时推断。为了高效提供这种计算需求,正在开发硬件加速器,并在各个尺度上部署。这自然需要一个高效的规模扩大机制,以按照应用程序的要求增加计算密度。 2.5D对干涉器的整合已经成为一个有希望的解决办法,但正如我们在这项工作中显示的那样,网络对包中的有限干涉器带宽和多次跳跃可以减少该方法的效益。为了应对这一挑战,我们建议WiENNA,一个无线无线无线无线无线无线无线无线 DD DNNN 加速器。在WiENNA,无线无线无线无线无线无线无线无线无线无线无线无线自动无线加速器将DNNN 加速器连接到全球缓冲芯片的阵列,提供高带宽多播送能力。在这里,我们还确定了最高效地利用无线无线无线 NoP基于高频带宽多播的多播式数据流风格。

0

相关内容

DNN

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

人工智能 | ACCV 2020等国际会议信息5条

人工智能 | ACCV 2020等国际会议信息5条

Call4Papers

6+阅读 · 2019年6月21日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

已删除

将门创投

5+阅读 · 2019年5月5日

AutoML与轻量模型大列表

AutoML与轻量模型大列表

专知

8+阅读 · 2019年4月29日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

一文道尽softmax loss及其变种

一文道尽softmax loss及其变种

极市平台

14+阅读 · 2019年2月19日

用霍夫变换&SCNN码一个车道追踪器

用霍夫变换&SCNN码一个车道追踪器

全球人工智能

4+阅读 · 2019年2月10日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Multi-Task Network Pruning and Embedded Optimization for Real-time Deployment in ADAS

Arxiv

0+阅读 · 2021年1月19日

Sniffing Multi-hop Multi-channel Wireless Sensor Networks

Arxiv

0+阅读 · 2021年1月19日

Trilevel Neural Architecture Search for Efficient Single Image Super-Resolution

Arxiv

0+阅读 · 2021年1月17日

Channel Tiling for Improved Performance and Accuracy of Optical Neural Network Accelerators

Arxiv

0+阅读 · 2021年1月15日

A Risk-Sensitive Task Offloading Strategy for Edge Computing in Industrial Internet of Things

Arxiv

0+阅读 · 2021年1月15日

Neural Architecture Generator Optimization

Arxiv

6+阅读 · 2020年10月8日

HAQ: Hardware-Aware Automated Quantization

HAQ: Hardware-Aware Automated Quantization

Arxiv

6+阅读 · 2018年11月21日

Neural Architecture Optimization

Neural Architecture Optimization

Arxiv

8+阅读 · 2018年9月5日

Pooling Pyramid Network for Object Detection

Arxiv

6+阅读 · 2018年7月9日

DARTS: Differentiable Architecture Search

Arxiv

3+阅读 · 2018年6月24日

VIP会员

文章信息

相关主题

Buffer（公司）

相关VIP内容

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

美军AI人物介绍 | 2025年美国政府和军方五大人工智能领导者

《战略决策流程：危机管理指南》最新36页报告

持续强化学习研究综述

中文版2500字 | 人工智能如何塑造伊朗-以色列12日战争

相关资讯

人工智能 | ACCV 2020等国际会议信息5条

人工智能 | ACCV 2020等国际会议信息5条

Call4Papers

6+阅读 · 2019年6月21日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

已删除

将门创投

5+阅读 · 2019年5月5日

AutoML与轻量模型大列表

AutoML与轻量模型大列表

专知

8+阅读 · 2019年4月29日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

一文道尽softmax loss及其变种

一文道尽softmax loss及其变种

极市平台

14+阅读 · 2019年2月19日

用霍夫变换&SCNN码一个车道追踪器

用霍夫变换&SCNN码一个车道追踪器

全球人工智能

4+阅读 · 2019年2月10日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Multi-Task Network Pruning and Embedded Optimization for Real-time Deployment in ADAS

Arxiv

0+阅读 · 2021年1月19日

Sniffing Multi-hop Multi-channel Wireless Sensor Networks

Arxiv

0+阅读 · 2021年1月19日

Trilevel Neural Architecture Search for Efficient Single Image Super-Resolution

Arxiv

0+阅读 · 2021年1月17日

Channel Tiling for Improved Performance and Accuracy of Optical Neural Network Accelerators

Arxiv

0+阅读 · 2021年1月15日

A Risk-Sensitive Task Offloading Strategy for Edge Computing in Industrial Internet of Things

Arxiv

0+阅读 · 2021年1月15日

Neural Architecture Generator Optimization

Arxiv

6+阅读 · 2020年10月8日

HAQ: Hardware-Aware Automated Quantization

HAQ: Hardware-Aware Automated Quantization

Arxiv

6+阅读 · 2018年11月21日

Neural Architecture Optimization

Neural Architecture Optimization

Arxiv

8+阅读 · 2018年9月5日

Pooling Pyramid Network for Object Detection

Arxiv

6+阅读 · 2018年7月9日

DARTS: Differentiable Architecture Search

Arxiv

3+阅读 · 2018年6月24日

微信扫码咨询专知VIP会员