The emergence of machine learning, image and audio processing on edge devices has motivated research towards power efficient custom hardware accelerators. Though FPGAs are an ideal target for energy efficient custom accelerators, the difficulty of hardware design and the lack of vendor agnostic, standardized hardware compilation infrastructure has hindered their adoption. This paper introduces HIR, an MLIR-based intermediate representation (IR) to describe hardware accelerator designs. HIR combines high level language features, such as loops and multi-dimensional tensors, with programmer defined explicit scheduling, to provide a high-level IR suitable for DSL compiler pipelines without compromising control over the micro-architecture of the accelerator. HIR's explicit schedules allow it to express fine-grained, synchronization-free parallelism and optimizations such as retiming and pipelining. Built as a dialect in MLIR, it draws from best IR practices learnt from communities like those of LLVM. While offering rich optimization opportunities and a high level abstraction, HIR enables sharing of optimizations, utilities and passes with software compiler infrastructure. Our implementation shows that the code generation time of the HIR code generator is on average 1112x lower than that of Xilinx Vivado HLS on a range of kernels without a compromise on the quality of the generated hardware. We believe that these are significant steps forward in the design of IRs for hardware synthesis and in equipping domain-specific languages with a productive and performing compilation path to custom hardware acceleration.
翻译:虽然FPGA是节能定制加速器的理想目标,但硬件设计困难和缺乏供应商不可知的标准化硬件汇编基础设施阻碍了其采用。本文介绍了基于MLIR的中间代号HIR, 一个基于MLIR的中间代号(IR)来描述硬件加速器的设计。HIR结合了高层次语言特征,如循环和多维扩音器,并确定了程序员的明确时间安排,为DSL编译管道提供了高水平的IR,同时为DS编译管道提供了高水平的IR,而不损害对加速器微结构的节能加速器的节能加速器控制。HIR的明确时间表使得它能够表达精细的、不同步的平行和优化,例如重塑和管道。在MLLIR中,作为方言,它借鉴了像LLVM这样的社区所学到的最佳IR做法,同时提供了丰富的优化机会和高层次的抽象信息,HIR能够分享对加速加速加速加速加速器的微结构的节能路的控制。HIR明确时间步骤,而我们在SIS的硬件标准中,我们在SLLVLMM等社区所生成的硬件的硬件标准中,我们在SLVDIS的硬件标准执行中,在SLIS的高级硬件标准中,在SDR节流流流流流中,在SDR标准中,在SDR标准中,这是我们的平均标准中,这是一个普通的普通的普通的普通的普通的普通的普通的普通的高级的硬件的硬件的编码的编码的编码的编码的编码中,这是一个比。