Lifelong on-device learning is a key challenge for machine intelligence, and this requires learning from few, often single, samples. Memory augmented neural network has been proposed to achieve the goal, but the memory module has to be stored in an off-chip memory due to its size. Therefore the practical use has been heavily limited. Previous works on emerging memory-based implementation have difficulties in scaling up because different modules with various structures are difficult to integrate on the same chip and the small sense margin of the content addressable memory for the memory module heavily limited the degree of mismatch calculation. In this work, we implement the entire memory augmented neural network architecture in a fully integrated memristive crossbar platform and achieve an accuracy that closely matches standard software on digital hardware for the Omniglot dataset. The successful demonstration is supported by implementing new functions in crossbars in addition to widely reported matrix multiplications. For example, the locality-sensitive hashing operation is implemented in crossbar arrays by exploiting the intrinsic stochasticity of memristor devices. Besides, the content-addressable memory module is realized in crossbars, which also supports the degree of mismatches. Simulations based on experimentally validated models show such an implementation can be efficiently scaled up for one-shot learning on the Mini-ImageNet dataset. The successful demonstration paves the way for practical on-device lifelong learning and opens possibilities for novel attention-based algorithms not possible in conventional hardware.
翻译:终身在设备上学习是机器智能的关键挑战,这需要从为数不多的、往往是单个的样本中学习。为了实现这一目标,已经提议了内存增强神经网络,但内存模块因其大小而必须储存在离芯存储器的存储器中。因此,实际用途非常有限。以前关于新记忆执行的工程在扩大规模方面有困难,因为不同结构的不同模块难以整合在同一芯片上,而内存模块内容的可读存储空间很小,这严重限制了错配计算的程度。在这项工作中,我们将整个内存增强神经网络结构在完全整合的常规跨截线平台中实施,并实现与Omniglot数据集数字硬件标准软件密切匹配的准确性。成功演示得到了支持,除了广泛报道的矩阵多重影响之外,还实施了交叉栏中的新功能。例如,通过利用内存模块的内在可读性可读存储存储存储器的可读取性,在跨卡路中实现内容可读存储模块,同时支持在跨卡路路里进行试验性的测试性学习,可以对数据库进行某种程度进行快速的读取。