Deep neural networks show great potential as solutions to many sensing application problems, but their excessive resource demand slows down execution time, pausing a serious impediment to deployment on low-end devices. To address this challenge, recent literature focused on compressing neural network size to improve performance. We show that changing neural network size does not proportionally affect performance attributes of interest, such as execution time. Rather, extreme run-time nonlinearities exist over the network configuration space. Hence, we propose a novel framework, called FastDeepIoT, that uncovers the non-linear relation between neural network structure and execution time, then exploits that understanding to find network configurations that significantly improve the trade-off between execution time and accuracy on mobile and embedded devices. FastDeepIoT makes two key contributions. First, FastDeepIoT automatically learns an accurate and highly interpretable execution time model for deep neural networks on the target device. This is done without prior knowledge of either the hardware specifications or the detailed implementation of the used deep learning library. Second, FastDeepIoT informs a compression algorithm how to minimize execution time on the profiled device without impacting accuracy. We evaluate FastDeepIoT using three different sensing-related tasks on two mobile devices: Nexus 5 and Galaxy Nexus. FastDeepIoT further reduces the neural network execution time by $48\%$ to $78\%$ and energy consumption by $37\%$ to $69\%$ compared with the state-of-the-art compression algorithms.
翻译:深心神经网络显示作为许多感应应用问题解决方案的巨大潜力, 但是它们过度的资源需求会拖慢执行时间, 严重妨碍低端装置的部署。 为了应对这一挑战, 最近的一些文献侧重于压缩神经网络规模以改善性能。 我们显示, 变化的神经网络规模不会按比例影响感兴趣的性能属性, 如执行时间。 相反, 网络配置空间上存在极端运行时的非线性性。 因此, 我们提议了一个新框架, 称为 FastDeepIoT, 揭示神经网络结构与执行时间之间的非线性关系, 然后利用这一理解来寻找能够大大改善执行时间与移动和嵌入装置的精确度之间的交易的网络配置。 Fast DeepIoT 做出两项关键贡献。 首先, FastDeepIoT 自动学习一个精确和高度可解释的深度神经网络运行时间模型。 这样做既没有了解硬件规格,也没有详细实施旧的深层学习图书馆。 其次, FastDeepIT 将一个压缩时间算法, 与快速智能 N- drealalalalalalal- dealalalalalal comduction comduction 5 Salalalalalal complactal 工作对比如何在快速定位设备上, 3 Stal- calal- calizalizalalal- salizalalalal imal imal imal imational imationalizal imational imational 上如何将如何在使用两个与快速定位设备上, 。