The design of heterogeneous systems that include domain specific accelerators is a challenging and time-consuming process. While taking into account area constraints, designers must decide which parts of an application to accelerate in hardware and which to leave in software. Moreover, applications in domains such as Extended Reality (XR) offer opportunities for various forms of parallel execution, including loop level, task level and pipeline parallelism. To assist the design process and expose every possible level of parallelism, we present Trireme, a fully automated tool-chain that explores multiple levels of parallelism and produces domain specific accelerator designs and configurations that maximize performance, given an area budget. Experiments on demanding benchmarks from the XR domain revealed a speedup of up to 20x, as well as a speedup of up to 37x for smaller applications, compared to software-only implementations.
翻译:设计包含特定域加速器的多元系统是一个具有挑战性和耗时的过程。在考虑区域限制的同时,设计师必须决定一个应用的哪些部分加速硬件,哪些部分加快软件。此外,扩展现实(XR)等领域的应用为各种形式的平行执行提供了机会,包括循环水平、任务水平和管道平行。为了协助设计过程并暴露各种可能的平行水平,我们向Trireme展示了一个完全自动化的工具链,它探索了多种平行水平,并生成了域专用加速器设计和配置,根据一个区域预算,可以最大限度地实现性能。对XR域要求基准的实验显示,与仅软件的运行相比,超速高达20x,小应用速度高达37x。