The design of many-core neuromorphic hardware is getting more and more complex as these systems are expected to execute large machine learning models. To deal with the design complexity, a predictable design flow is needed to guarantee real-time performance such as latency and throughput without significantly increasing the buffer requirement of computing cores. Synchronous Data Flow Graphs (SDFGs) are used for predictable mapping of streaming applications to multiprocessor systems. We propose an SDFG-based design flow for mapping spiking neural networks (SNNs) to many-core neuromorphic hardware with the objective of exploring the tradeoff between throughput and buffer size. The proposed design flow integrates an iterative partitioning approach, based on Kernighan-Lin graph partitioning heuristic, creating SNN clusters such that each cluster can be mapped to a core of the hardware. The partitioning approach minimizes the inter-cluster spike communication, which improves latency on the shared interconnect of the hardware. Next, the design flow maps clusters to cores using an instance of the Particle Swarm Optimization (PSO), an evolutionary algorithm, exploring the design space of throughput and buffer size. Pareto optimal mappings are retained from the design flow, allowing system designers to select a Pareto mapping that satisfies throughput and buffer size requirements of the design. We evaluated the design flow using five large-scale convolutional neural network (CNN) models. Results demonstrate 63% higher maximum throughput and 10% lower buffer size requirement compared to state-of-the-art dataflow-based mapping solutions.
翻译:许多核心神经变形硬件的设计越来越复杂,因为这些系统预计将执行大型机器学习模型。为了应对设计的复杂性,需要可预测的设计流程,以保证实时性能,如悬浮和吞吐量,而不会大幅增加计算核心的缓冲要求。同步数据流动图(SDFGs)用于对多处理器系统的流应用进行可预测的绘图。我们提议以SDFG为基础的设计流程,用于将神经网络(SNNS)与许多核心神经变异硬件进行绘图,目的是探索吞吐量和缓冲规模之间的交易。拟议的设计流程需要基于Kernighan-Lin图形偏移和吞吐量,从而保证实时分配方法,使每个组群能够被映射到多处理器系统的一个核心。我们提议了一个基于SDFG的设计流程,使基于内存的神经网络网络(SNNNN)更好地连接到核心,从而利用Peetrom 内流(PSO) 比较内流和缓冲系统(PADLIL) 的大小,一个通过最高级的编程和最高级的编程系统进行空间设计演算,从而将数据从PAdel-destrual-deal-de-deal-destrual-deal-destrisl) 将数据流到通过一个通过五级的系统进行空间设计到最精制的系统,将数据流到最深的系统进行空间设计流程。