f-CNN$ ⁇ text{x}$: 绘制多CNN在FPGAs上的应用工具流 (f-CNN$^{\text{x}}$: A Toolflow for Mapping Multi-CNN Applications on FPGAs)

The predictive power of Convolutional Neural Networks (CNNs) has been an integral factor for emerging latency-sensitive applications, such as autonomous drones and vehicles. Such systems employ multiple CNNs, each one trained for a particular task. The efficient mapping of multiple CNNs on a single FPGA device is a challenging task as the allocation of compute resources and external memory bandwidth needs to be optimised at design time. This paper proposes f-CNN$^{\text{x}}$, an automated toolflow for the optimised mapping of multiple CNNs on FPGAs, comprising a novel multi-CNN hardware architecture together with an automated design space exploration method that considers the user-specified performance requirements for each model to allocate compute resources and generate a synthesisable accelerator. Moreover, f-CNN$^{\text{x}}$ employs a novel scheduling algorithm that alleviates the limitations of the memory bandwidth contention between CNNs and sustains the high utilisation of the architecture. Experimental evaluation shows that f-CNN$^{\text{x}}$'s designs outperform contention-unaware FPGA mappings by up to 50% and deliver up to 6.8x higher performance-per-Watt over highly optimised GPU designs for multi-CNN systems.

翻译：革命神经网络(CNNs)的预测力是新兴隐性敏感应用(如自主无人驾驶飞机和车辆)的一个不可或缺的因素。这种系统使用多个CNN, 每一个都经过特定任务培训。在单一的FPGA设备上对多个CNN进行高效绘图是一项具有挑战性的任务,因为计算资源和外部记忆带宽的分配需要在设计时加以优化。本文提议在FPGAs上对多个CNN进行优化绘图的自动工具流f- CN$ text{x ⁇ $, 包括一个新型的多CNN硬件结构,加上一个自动化设计空间探索方法,该方法考虑到每个模型的用户指定性能要求,以分配计算资源并生成一个可合成的加速器。此外, f- CN${text{x$x$在设计时需要优化计算资源和外部记忆带宽度带宽度带宽度的配置。本文提议在FPG- PA系统上保持高利用率。实验性能评估显示,f-CN$N$NN$NNN$硬件设计超越高端的GFPA系统, 将GPAFAS- 2012- profrofard- proformadestrax

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

哈工大SCIR 14篇长文被ACL 2021主会/Findings和IJCAI 2021录用

专知会员服务

56+阅读 · 2021年5月10日

深度神经网络模型压缩综述

专知会员服务

116+阅读 · 2020年8月22日

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

专知会员服务

136+阅读 · 2020年3月8日

自动驾驶汽车的协调:分类和调查综述（Coordination of Autonomous Vehicles: Taxonomy and Survey），附31页pdf

专知会员服务

14+阅读 · 2020年1月9日