Research on Deep Neural Networks (DNNs) has focused on improving performance and accuracy for real-world deployments, leading to new models, such as Spiking Neural Networks (SNNs), and optimization techniques, e.g., quantization and pruning for compressed networks. However, the deployment of these innovative models and optimization techniques introduces possible reliability issues, which is a pillar for DNNs to be widely used in safety-critical applications, e.g., autonomous driving. Moreover, scaling technology nodes have the associated risk of multiple faults happening at the same time, a possibility not addressed in state-of-the-art resiliency analyses. Towards better reliability analysis for DNNs, we present enpheeph, a Fault Injection Framework for Spiking and Compressed DNNs. The enpheeph framework enables optimized execution on specialized hardware devices, e.g., GPUs, while providing complete customizability to investigate different fault models, emulating various reliability constraints and use-cases. Hence, the faults can be executed on SNNs as well as compressed networks with minimal-to-none modifications to the underlying code, a feat that is not achievable by other state-of-the-art tools. To evaluate our enpheeph framework, we analyze the resiliency of different DNN and SNN models, with different compression techniques. By injecting a random and increasing number of faults, we show that DNNs can show a reduction in accuracy with a fault rate as low as 7 x 10 ^ (-7) faults per parameter, with an accuracy drop higher than 40%. Run-time overhead when executing enpheeph is less than 20% of the baseline execution time when executing 100 000 faults concurrently, at least 10x lower than state-of-the-art frameworks, making enpheeph future-proof for complex fault injection scenarios. We release enpheeph at https://github.com/Alexei95/enpheeph.
翻译:深神经网络(DNN)的研究侧重于提高实际世界部署的性能和准确性,导致新的模型,如Spiking神经网络(SNN),以及优化技术,例如对压缩网络的量化和修剪。然而,这些创新模型和优化技术的部署带来了可能的可靠性问题,这是DNN在安全关键应用程序(如自主驱动)中广泛使用的一个支柱。此外,技术节点具有同时发生多重故障的相关风险,这种风险在Spiking Neural网络(SNNNN)和优化技术(如SNNN)的可靠性分析中得不到解决。但是,这些创新模型和优化技术的部署为Spiking and Concompress DNNNNC提供最佳执行力,而用于调查不同错误模型、模拟各种可靠性限制和使用案例的完全自定义。因此,在SNNNFER的精确度分析中,在SNPO-deal-deal-deal-deal-deal-deal-deal-deal-deal-de-deal-deal-deal-de-de-deal-deal-de ruil-de-de-de ruil-de-de-deal-de-de-de-de-deal-deal-n-de-de-de-deisl)中,我们可以以10-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-defl-defal-de-s-de-s-s-s-de-de-def-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-s-s-de-de-de-de-de-de-de-de-de-defer-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de