Processing In Memory (PIM) accelerators are promising architecture that can provide massive parallelization and high efficiency in various applications. Such architectures can instantaneously provide ultra-fast operation over extensive data, allowing real-time performance in data-intensive workloads. For instance, Resistive Memory (ReRAM) based PIM architectures are widely known for their inherent dot-product computation capability. While the performance of such architecture is essential, reliability and accuracy are also important, especially in mission-critical real-time systems. Unfortunately, the PIM architectures have a fundamental limitation in guaranteeing error-free operation. As a result, current methods must pay high implementation costs or performance penalties to achieve reliable execution in the PIM accelerator. In this paper, we make a fundamental observation of this reliability limitation of ReRAM based PIM architecture. Accordingly, we propose a novel solution--Falut Tolerant PIM or FAT-PIM, that can improve reliability for such systems significantly at a low cost. Our evaluation shows that we can improve the error tolerance significantly with only 4.9% performance cost and 3.9% storage overhead.
翻译:记忆中处理加速器(PIM)是大有希望的架构,可以在各种应用中提供大规模平行和高效率。这些架构可以瞬间提供超快操作,超过广泛的数据,允许数据密集型工作量的实时性能。例如,基于保存内存(RERM)的PIM架构因其固有的点数计算能力而广为人知。虽然这种架构的性能至关重要,但可靠性和准确性也很重要,特别是在任务危急的实时系统中。不幸的是,PIM架构在保障无误操作方面有着根本性的局限性。因此,目前的方法必须支付高额执行成本或性能罚款,才能在PIM加速器中实现可靠的执行。在本文中,我们对基于ReRAM的PIM架构的可靠性限制进行了基本观察。因此,我们建议采用新型的解决方案-高压调PIM或FAT-PIM,这样可以以低廉的成本大幅提高这些系统的可靠性。我们的评估表明,我们可以用只有4.9%的性能成本和3.9%的存储间接费用大大改进错误容忍度。