Bifrost: 对可重新配置 DNN 加速器的端对端评价和优化 (Bifrost: End-to-End Evaluation and Optimization of Reconfigurable DNN Accelerators) - 专知论文

会员服务 ·

0

优化器 · DNN · 端到端 · 推断 · AutoTVM ·

2022 年 4 月 26 日

Bifrost: End-to-End Evaluation and Optimization of Reconfigurable DNN Accelerators

翻译：Bifrost: 对可重新配置 DNN 加速器的端对端评价和优化

Axel Stjerngren,Perry Gibson,José Cano

from arxiv, This paper is accepted to ISPASS 2022

Reconfigurable accelerators for deep neural networks (DNNs) promise to improve performance such as inference latency. STONNE is the first cycle-accurate simulator for reconfigurable DNN inference accelerators which allows for the exploration of accelerator designs and configuration space. However, preparing models for evaluation and exploring configuration space in STONNE is a manual developer-timeconsuming process, which is a barrier for research. This paper introduces Bifrost, an end-to-end framework for the evaluation and optimization of reconfigurable DNN inference accelerators. Bifrost operates as a frontend for STONNE and leverages the TVM deep learning compiler stack to parse models and automate offloading of accelerated computations. We discuss Bifrost's advantages over STONNE and other tools, and evaluate the MAERI and SIGMA architectures using Bifrost. Additionally, Bifrost introduces a module leveraging AutoTVM to efficiently explore accelerator designs and dataflow mapping space to optimize performance. This is demonstrated by tuning the MAERI architecture and generating efficient dataflow mappings for AlexNet, obtaining an average speedup of $50\times$ for the convolutional layers and $11\times$ for the fully connected layers. Our code is available at www.github.com/gicLAB/bifrost.

翻译：用于深神经网络的可重新配置加速器(DNNS), 承诺提高性能, 如推导延迟。 STONNE 是第一个周期性准确模拟器, 用于重新配置 DNN 推推加速器, 用于探索加速器设计和配置空间。然而, 准备用于评估和探索STONNE 配置空间的模型是一个手工开发的耗时空间过程, 这是一个研究屏障。本文介绍了Bifrost, 一个用于评估和优化可重新配置的 DNNE 加速器的终端到终端框架。 BOfrost作为StoonNE的前端操作, 利用TVM深学习编译器堆来分析加速器设计和配置空间配置空间配置空间。我们讨论Bifrost在STONNE和其他工具上的优势, 并用Bifrost 来评估MAERI和SIGMA的架构。此外, Bifrost 引入一个模块, 利用AutiveTVM 来高效探索可重新配置的 DELNNE 高级数据结构设计, 和通过显示的MALLADR dal droad 数据流, 优化数据流, 数据流, 数据流, 正在通过演示数据流进行数据流, 将数据流进行数据流优化, 将数据流优化到生成到数据流的图像平流,, 以生成数据结构进行数据流优化到数据流数据结构, 优化到数据流, 。

0

相关内容

优化器

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日

【Freddy Lecue博士】Thales嵌入式可解释AI：关键系统中AI的采用（Thales Embedded Explainable AI: Towards the Adoption of AI in Critical Systems.），AI Accelerator Summit 2019

【Freddy Lecue博士】Thales嵌入式可解释AI：关键系统中AI的采用（Thales Embedded Explainable AI: Towards the Adoption of AI in Critical Systems.），AI Accelerator Summit 2019

专知会员服务

21+阅读 · 2019年11月11日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

广义Lorenz系统族解的有界性研究

国家自然科学基金

0+阅读 · 2015年12月31日

量子群与Tewilliger代数的相关问题研究

国家自然科学基金

1+阅读 · 2013年12月31日

DACI1 调控Cyt b6/f 复合物组装的功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

SiCp/Al复合材料超精密切削加工表面/亚表面损伤形成机理与抑制方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

真菌云芝-F21a高效消除蓝藻水华的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

水莱茵海默氏菌 (Rheinheimera aquimaris) 淬灭细菌群体感应的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

PET/SPECT同机同时成像方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

水溶性的新型BODIPY近红外荧光染料的合成及细胞荧光成像研究

国家自然科学基金

0+阅读 · 2012年12月31日

靶向活性caspase-6用于脑胶质瘤基因治疗研究

国家自然科学基金

0+阅读 · 2008年12月31日

新型喜树碱衍生物MONCPT对肿瘤细胞周期调控的深入研究

国家自然科学基金

0+阅读 · 2008年12月31日

In Defense of Core-set: A Density-aware Core-set Selection for Active Learning

Arxiv

0+阅读 · 2022年6月13日

Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits

Arxiv

0+阅读 · 2022年6月12日

Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policies

Arxiv

0+阅读 · 2022年6月12日

Understanding Linearity of Cross-Lingual Word Embedding Mappings

Arxiv

0+阅读 · 2022年6月11日

Channel Estimation for Massive MIMO systems using Tensor Cores in GPU

Arxiv

0+阅读 · 2022年6月11日

Real-time Hyper-Dimensional Reconfiguration at the Edge using Hardware Accelerators

Real-time Hyper-Dimensional Reconfiguration at the Edge using Hardware Accelerators

Arxiv

0+阅读 · 2022年6月10日

Simple lessons from complex learning: what a neural network model learns about cosmic structure formation

Arxiv

0+阅读 · 2022年6月9日

Open ERP System Data For Occupational Fraud Detection

Open ERP System Data For Occupational Fraud Detection

Arxiv

0+阅读 · 2022年6月9日

CARLA-GeAR: a Dataset Generator for a Systematic Evaluation of Adversarial Robustness of Vision Models

Arxiv

0+阅读 · 2022年6月9日

Differentiable Dynamic Programming for Structured Prediction and Attention

Arxiv

56+阅读 · 2018年2月20日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日

【Freddy Lecue博士】Thales嵌入式可解释AI：关键系统中AI的采用（Thales Embedded Explainable AI: Towards the Adoption of AI in Critical Systems.），AI Accelerator Summit 2019

【Freddy Lecue博士】Thales嵌入式可解释AI：关键系统中AI的采用（Thales Embedded Explainable AI: Towards the Adoption of AI in Critical Systems.），AI Accelerator Summit 2019

专知会员服务

21+阅读 · 2019年11月11日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

白宫发布《赢得AI竞赛：美国人工智能行动计划》最新28页

人工智能战争：以色列、伊朗与新型AI战争形态

《美地下作战条令手册》228页

《即时战争：人工智能如何决定军事干预》

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

In Defense of Core-set: A Density-aware Core-set Selection for Active Learning

Arxiv

0+阅读 · 2022年6月13日

Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits

Arxiv

0+阅读 · 2022年6月12日

Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policies

Arxiv

0+阅读 · 2022年6月12日

Understanding Linearity of Cross-Lingual Word Embedding Mappings

Arxiv

0+阅读 · 2022年6月11日

Channel Estimation for Massive MIMO systems using Tensor Cores in GPU

Arxiv

0+阅读 · 2022年6月11日

Real-time Hyper-Dimensional Reconfiguration at the Edge using Hardware Accelerators

Real-time Hyper-Dimensional Reconfiguration at the Edge using Hardware Accelerators

Arxiv

0+阅读 · 2022年6月10日

Simple lessons from complex learning: what a neural network model learns about cosmic structure formation

Arxiv

0+阅读 · 2022年6月9日

Open ERP System Data For Occupational Fraud Detection

Open ERP System Data For Occupational Fraud Detection

Arxiv

0+阅读 · 2022年6月9日

CARLA-GeAR: a Dataset Generator for a Systematic Evaluation of Adversarial Robustness of Vision Models

Arxiv

0+阅读 · 2022年6月9日

Differentiable Dynamic Programming for Structured Prediction and Attention

Arxiv

56+阅读 · 2018年2月20日

相关基金

广义Lorenz系统族解的有界性研究

国家自然科学基金

0+阅读 · 2015年12月31日

量子群与Tewilliger代数的相关问题研究

国家自然科学基金

1+阅读 · 2013年12月31日

DACI1 调控Cyt b6/f 复合物组装的功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

SiCp/Al复合材料超精密切削加工表面/亚表面损伤形成机理与抑制方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

真菌云芝-F21a高效消除蓝藻水华的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

水莱茵海默氏菌 (Rheinheimera aquimaris) 淬灭细菌群体感应的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

PET/SPECT同机同时成像方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

水溶性的新型BODIPY近红外荧光染料的合成及细胞荧光成像研究

国家自然科学基金

0+阅读 · 2012年12月31日

靶向活性caspase-6用于脑胶质瘤基因治疗研究

国家自然科学基金

0+阅读 · 2008年12月31日

新型喜树碱衍生物MONCPT对肿瘤细胞周期调控的深入研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员