思维图绘制:使能高效地测算仪-加速器绘制空间搜索图 (Mind Mappings: Enabling Efficient Algorithm-Accelerator Mapping Space Search)

from arxiv, Appears in the proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS '21), April 19-23, 2021, Virtual, USA

Modern day computing increasingly relies on specialization to satiate growing performance and efficiency requirements. A core challenge in designing such specialized hardware architectures is how to perform mapping space search, i.e., search for an optimal mapping from algorithm to hardware. Prior work shows that choosing an inefficient mapping can lead to multiplicative-factor efficiency overheads. Additionally, the search space is not only large but also non-convex and non-smooth, precluding advanced search techniques. As a result, previous works are forced to implement mapping space search using expert choices or sub-optimal search heuristics. This work proposes Mind Mappings, a novel gradient-based search method for algorithm-accelerator mapping space search. The key idea is to derive a smooth, differentiable approximation to the otherwise non-smooth, non-convex search space. With a smooth, differentiable approximation, we can leverage efficient gradient-based search algorithms to find high-quality mappings. We extensively compare Mind Mappings to black-box optimization schemes used in prior work. When tasked to find mappings for two important workloads (CNN and MTTKRP), the proposed search finds mappings that achieve an average $1.40\times$, $1.76\times$, and $1.29\times$ (when run for a fixed number of steps) and $3.16\times$, $4.19\times$, and $2.90\times$ (when run for a fixed amount of time) better energy-delay product (EDP) relative to Simulated Annealing, Genetic Algorithms and Reinforcement Learning, respectively. Meanwhile, Mind Mappings returns mappings with only $5.32\times$ higher EDP than a possibly unachievable theoretical lower-bound, indicating proximity to the global optima.

翻译：现代日计算日益依赖专业化来满足不断增长的绩效和效率要求。设计这类专门硬件结构的核心挑战是如何进行空间绘图搜索,即从算法到硬件的搜索。先前的工作表明,选择效率低的映射可以导致多重性因素效率管理。此外,搜索空间不仅大,而且非混凝土和非混凝土,从而排除了先进的搜索技术。因此,以往的工作被迫使用专家选择或亚最佳搜索超模来进行空间搜索。这项工作提出了“思维映射”,这是用于算法-加速器空间搜索的新型梯度搜索方法。关键的想法是,选择效率低的映射可以导致多重因素-效率管理。此外,我们还可以利用高效的基于梯度的搜索算法来找到高质量的映射。我们把“思维映射”比先前工作中使用的黑格优化计划(N90美元和MTERCFM)要广泛比较。当为两个重要的工作量(CN-90美元和亚梯-39美元的相对值)进行快速的绘图时,为平均40美元的固定时间值(美元)进行搜索,一个固定值的固定值和最低值(美元),一个固定时间值的搜索,一个固定值为40美元的值的值的值的算值的算值为40值的值的值的算算,然后表示。