Computing-in-memory with emerging non-volatile memory (nvCiM) is shown to be a promising candidate for accelerating deep neural networks (DNNs) with high energy efficiency. However, most non-volatile memory (NVM) devices suffer from reliability issues, resulting in a difference between actual data involved in the nvCiM computation and the weight value trained in the data center. Thus, models actually deployed on nvCiM platforms achieve lower accuracy than their counterparts trained on the conventional hardware (e.g., GPUs). In this chapter, we first offer a brief introduction to the opportunities and challenges of nvCiM DNN accelerators and then show the properties of different types of NVM devices. We then introduce the general architecture of nvCiM DNN accelerators. After that, we discuss the source of unreliability and how to efficiently model their impact. Finally, we introduce representative works that mitigate the impact of device variations.
翻译:与新兴非挥发性内存(nvCiM)的计算机内存储器被证明是加速高能效深神经网络(DNN)的一个有希望的候选对象,然而,大多数非挥发性内存(NVM)装置都存在可靠性问题,导致NvCiM计算中的实际数据与数据中心培训的重量值之间的差异。因此,在nvCiM平台上实际部署的模型的准确性低于在常规硬件(例如GPUs)上培训的对应方。在本章中,我们首先简要介绍了nvCiM DNNN加速器的机会和挑战,然后展示了不同类型NVM装置的特性。我们随后引入了nvCiM DNNNC加速器的一般结构。之后,我们讨论了不可靠性的来源以及如何有效地模拟其影响。最后,我们介绍了减轻设备变异影响的有代表性的工作。