This paper presents HALO 1.0, an open-ended extensible multi-agent software framework that implements a set of proposed hardware-agnostic accelerator orchestration (HALO) principles. HALO implements a novel compute-centric message passing interface (C^2MPI) specification for enabling the performance portable execution of a hardware-agnostic host application across heterogeneous accelerators. The experiment results of evaluating eight widely used HPC subroutines based on Intel Xeon E5-2620 CPUs, Intel Arria 10 GX FPGAs, and NVIDIA GeForce RTX 2080 Ti GPUs show that HALO 1.0 allows for a unified control flow for host programs to run across all the computing devices with a consistently top performance portability score, which is up to five orders of magnitude higher than the OpenCL-based solution.
翻译:本文件介绍了HALO 1.0,这是一个可自由扩展的多试剂多试剂软件框架,可以实施一套拟议的硬件-加速器管弦化(HALO)原则。HALO采用新的计算中心电文传递界面(C ⁇ 2MPI)规格,以便能够在各种加速器之间执行硬件-加速器的可移植主机应用程序。根据Intel Xeon E5-2620 CPUs、Intel Ariay 10 GX FPGAs和NVIDIA GeForace RTX 2080 Ti GPUs,对八种广泛使用的HPC子程序进行评估的实验结果显示,HALO 1.0允许主机程序使用统一的控制流,以一致的顶级可移植性分数运行所有计算机设备,该可移动性可达比基于开放控制路的解决方案高五级。