Recent advances in Deep Neural Networks (DNNs) have led to active development of specialized DNN accelerators, many of which feature a large number of processing elements laid out spatially, together with a multi-level memory hierarchy and flexible interconnect. While DNN accelerators can take advantage of data reuse and achieve high peak throughput, they also expose a large number of runtime parameters to the programmers who need to explicitly manage how computation is scheduled both spatially and temporally. In fact, different scheduling choices can lead to wide variations in performance and efficiency, motivating the need for a fast and efficient search strategy to navigate the vast scheduling space. To address this challenge, we present CoSA, a constrained-optimization-based approach for scheduling DNN accelerators. As opposed to existing approaches that either rely on designers' heuristics or iterative methods to navigate the search space, CoSA expresses scheduling decisions as a constrained-optimization problem that can be deterministically solved using mathematical optimization techniques. Specifically, CoSA leverages the regularities in DNN operators and hardware to formulate the DNN scheduling space into a mixed-integer programming (MIP) problem with algorithmic and architectural constraints, which can be solved to automatically generate a highly efficient schedule in one shot. We demonstrate that CoSA-generated schedules significantly outperform state-of-the-art approaches by a geometric mean of up to 2.5x across a wide range of DNN networks while improving the time-to-solution by 90x.
翻译:深神经网络(DNN)最近的进展导致专门DNN的专用加速器(DNN)的积极发展,其中许多具有大量空间化的处理要素,以及多层次的内存等级和灵活的互连。虽然DNN加速器可以利用数据再利用和实现高峰输送量,但它们也暴露了程序员的大量运行时间参数,他们需要明确管理如何在空间和时间上安排计算。事实上,不同的时间安排选择可能导致性能和效率方面的广泛差异,从而促使需要有一个快速高效的搜索战略来浏览广阔的排程空间。为了应对这一挑战,我们介绍COSA,一种基于限制优化的办法来安排DNNNC加速器。相对于现有的方法,即依靠设计师的超自然理论或迭接力方法来导航搜索空间,CSA表示,时间安排决定是一种限制性的优化问题,可以通过数学优化技术来解决。具体地SA利用DNNNO操作员和硬件的常规化搜索策略来应对大量时间框架,同时将DNNNM-M-M-F-R-C-C-C-SAL-C-C-SAL-SAL-SL-C-SL-SL-SL-SL-SL-SL-SL-SL-SL-C-SIR-Sirentral-SL-SL-C-C-SL-C-C-SL-SL-C-C-C-C-SL-SL-SL-SL-C-SL-C-C-C-C-C-C-SL-C-C-SL-SL-SL-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-SL-C-C-SL-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-