Vector architectures are gaining traction for highly efficient processing of data-parallel workloads, driven by all major ISAs (RISC-V, Arm, Intel), and boosted by landmark chips, like the Arm SVE-based Fujitsu A64FX, powering the TOP500 leader Fugaku. The RISC-V V extension has recently reached 1.0-Frozen status. Here, we present its first open-source implementation, discuss the new specification's impact on the micro-architecture of a lane-based design, and provide insights on performance-oriented design of coupled scalar-vector processors. Our system achieves comparable/better PPA than state-of-the-art vector engines that implement older RVV versions: 15% better area, 6% improved throughput, and FPU utilization >98.5% on crucial kernels.
翻译:矢量结构正在逐渐得到驱动,以便高效处理由所有主要国际审计机构(IRSC-V、Arm、Intel)驱动的数据平行工作量,并受到里程碑芯片的推动,如Arm SVE-Fujitsu A64FX,赋予TOP500领导Fugaku的权力。RISC-V V的扩展最近达到了1.0-Frozen的地位。在这里,我们介绍了其第一个开放源码实施情况,讨论了新规格对基于通道的设计的微结构的影响,并介绍了以性能为导向的混合天动电动电动处理器设计。我们的系统比使用老式RVV版本的最先进的矢量引擎实现了可比/更好的 PPA:15%更好的区域,6%的吞吐量得到改善,关键内核的FPU利用率为98.5 % 。