Energy harvesting (EH) IoT devices that operate intermittently without batteries, coupled with advances in deep neural networks (DNNs), have opened up new opportunities for enabling sustainable smart applications. Nevertheless, implementing those computation and memory-intensive intelligent algorithms on EH devices is extremely difficult due to the challenges of limited resources and intermittent power supply that causes frequent failures. To address those challenges, this paper proposes a methodology that enables super-fast deep learning with low-energy accelerators for tiny energy harvesting devices. We first propose RAD, a resource-aware structured DNN training framework, which employs block circulant matrix with ADMM to achieve high compression and model quantization for leveraging the advantage of various vector operation accelerators. A DNN implementation method, ACE, is then proposed that employs low-energy accelerators to profit maximum performance with minor energy consumption. Finally, we further design FLEX, the system support for intermittent computation in energy harvesting situations. Experimental results from three different DNN models demonstrate that RAD, ACE, and FLEX can enable super-fast and correct inference on energy harvesting devices with up to 4.26X runtime reduction, up to 7.7X energy reduction with higher accuracy over the state-of-the-art.
翻译:然而,由于资源有限和断断续续的电力供应造成经常失败的挑战,实施这些计算和记忆密集的智能算法极为困难。为了应对这些挑战,本文件提出了一种方法,使利用低能加速器进行低能加速器进行小型能源收获装置的超快深层次学习。我们首先建议了一种资源智能结构结构的DNN培训框架RAD,即与ADMM公司使用块状螺旋体矩阵实现高压缩和模型量化,以利用各种矢量加速器的优势。然后建议采用DNNE实施方法,即ACE,采用低能加速器,以最大限度地提高能源消耗的性能。最后,我们进一步设计FLEX,在能源收获情况下支持间歇性计算。三个不同的DNNM模型的实验结果显示,RAD、ACE和FLEX能够使AD、ACE和FLEX能够实现高压压和正确度量化,从而降低X级能源采集的精确度,直至4x级降低能源采集的精确度。