FastDeepIoT:努力了解和优化移动和嵌入装置的神经网络执行时间 (FastDeepIoT: Towards Understanding and Optimizing Neural Network Execution Time on Mobile and Embedded Devices)

Deep neural networks show great potential as solutions to many sensing application problems, but their excessive resource demand slows down execution time, pausing a serious impediment to deployment on low-end devices. To address this challenge, recent literature focused on compressing neural network size to improve performance. We show that changing neural network size does not proportionally affect performance attributes of interest, such as execution time. Rather, extreme run-time nonlinearities exist over the network configuration space. Hence, we propose a novel framework, called FastDeepIoT, that uncovers the non-linear relation between neural network structure and execution time, then exploits that understanding to find network configurations that significantly improve the trade-off between execution time and accuracy on mobile and embedded devices. FastDeepIoT makes two key contributions. First, FastDeepIoT automatically learns an accurate and highly interpretable execution time model for deep neural networks on the target device. This is done without prior knowledge of either the hardware specifications or the detailed implementation of the used deep learning library. Second, FastDeepIoT informs a compression algorithm how to minimize execution time on the profiled device without impacting accuracy. We evaluate FastDeepIoT using three different sensing-related tasks on two mobile devices: Nexus 5 and Galaxy Nexus. FastDeepIoT further reduces the neural network execution time by $48\%$ to $78\%$ and energy consumption by $37\%$ to $69\%$ compared with the state-of-the-art compression algorithms.

翻译：深心神经网络显示作为许多感应应用问题解决方案的巨大潜力, 但是它们过度的资源需求会拖慢执行时间, 严重妨碍低端装置的部署。为了应对这一挑战, 最近的一些文献侧重于压缩神经网络规模以改善性能。我们显示, 变化的神经网络规模不会按比例影响感兴趣的性能属性, 如执行时间。相反, 网络配置空间上存在极端运行时的非线性性。因此, 我们提议了一个新框架, 称为 FastDeepIoT, 揭示神经网络结构与执行时间之间的非线性关系, 然后利用这一理解来寻找能够大大改善执行时间与移动和嵌入装置的精确度之间的交易的网络配置。 Fast DeepIoT 做出两项关键贡献。首先, FastDeepIoT 自动学习一个精确和高度可解释的深度神经网络运行时间模型。这样做既没有了解硬件规格,也没有详细实施旧的深层学习图书馆。其次, FastDeepIT 将一个压缩时间算法, 与快速智能 N- drealalalalalalal- dealalalalalal comduction comduction 5 Salalalalalal complactal 工作对比如何在快速定位设备上, 3 Stal- calal- calizalizalalal- salizalalalal imal imal imal imational imationalizal imational imational 上如何将如何在使用两个与快速定位设备上, 。

相关内容

Neural Networks

关注 1648

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

【ICML2020】深度神经网络置信感知学习，Conﬁdence-Aware Learning for Deep Neural Networks

专知会员服务

74+阅读 · 2020年7月6日

【SIGIR2020】多检索系统的贝叶斯推理风险评估，Bayesian Inferential Risk Evaluation On Multiple IR Systems

专知会员服务

9+阅读 · 2020年6月10日

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日