This paper presents HALO 1.0, an open-ended extensible multi-agent software framework that implements a set of proposed hardware-agnostic accelerator orchestration (HALO) principles. HALO implements a novel compute-centric message passing interface (C^2MPI) specification for enabling the performance-portable execution of a hardware-agnostic host application across heterogeneous accelerators. The experiment results of evaluating eight widely used HPC subroutines based on Intel Xeon E5-2620 CPUs, Intel Arria 10 GX FPGAs, and NVIDIA GeForce RTX 2080 Ti GPUs show that HALO 1.0 allows for a unified control flow for host programs to run across all the computing devices with a consistently top performance portability score, which is up to five orders of magnitude higher than the OpenCL-based solution.
翻译:本文件介绍了HALO 1.0,这是一个可自由扩展的多试剂多试剂软件框架,可以实施一套拟议的硬件加速器管弦化(HALO)原则。HALO采用新的计算中心电文传递界面(C ⁇ 2MPI)规格,以便能够在各种加速器之间以可操作的便携式方式执行硬件控制主机应用程序。根据Intel Xeon E5-2620 CPUs、Intel Ariay 10 GX FPGAs和NVIDIA Geforace RTX 2080 Ti GPUs对八种广泛使用的HPC子程序进行评估的实验结果。HLO 1.0允许主机程序使用统一的控制流程,以一致的顶级可移植性分数运行所有计算机设备,最高可移动性分可达五级,高于基于可开氯氟化物的解决方案。