Energy harvesting (EH) IoT devices that operate intermittently without batteries, coupled with advances in deep neural networks (DNNs), have opened up new opportunities for enabling sustainable smart applications. Nevertheless, implementing those computation and memory-intensive intelligent algorithms on EH devices is extremely difficult due to the challenges of limited resources and intermittent power supply that causes frequent failures. To address those challenges, this paper proposes a methodology that enables fast deep learning with low-energy accelerators for tiny energy harvesting devices. We first propose $RAD$, a resource-aware structured DNN training framework, which employs block circulant matrix and structured pruning to achieve high compression for leveraging the advantage of various vector operation accelerators. A DNN implementation method, $ACE$, is then proposed that employs low-energy accelerators to profit maximum performance with small energy consumption. Finally, we further design $FLEX$, the system support for intermittent computation in energy harvesting situations. Experimental results from three different DNN models demonstrate that $RAD$, $ACE$, and $FLEX$ can enable fast and correct inference on energy harvesting devices with up to 4.26X runtime reduction, up to 7.7X energy reduction with higher accuracy over the state-of-the-art.
翻译:然而,由于资源有限和断断续续的电力供应造成经常失败的挑战,实施这些计算和记忆密集的智能算法极为困难。为了应对这些挑战,本文件建议了一种方法,使利用低能加速器快速深层次学习小节能收获装置的低能加速器。我们首先提议了一种资源智能结构化的DNN培训框架,即资源智能结构化DNN培训框架$RAD$,使用块状螺旋矩阵和结构化的剪接机实现高压,以利用各种矢量加速器的优势。然后提议采用DNNN执行方法,即$ACE$,采用低能加速器,以最大限度地利用小型能源消耗获得性能。最后,我们进一步设计了美元FLEX,系统支持在能源收获情况下进行间歇性计算。三个不同的DNNM模型的实验结果显示,用美元、美元和美元FLEX$可以快速和准确地降低各种矢量加速度。 26DNNNC将能源回收设备降低到4X。