This paper assesses and reports the experience of eleven application teams working to build, validate, and benchmark several HPC applications on a novel GPU-accelerated Arm testbed. The testbed consists of the latest, at time of writing, Arm Devkits from NVIDIA with server-class Arm CPUs and NVIDIA A100 GPUs. The applications and mini-apps are written using multiple parallel programming models, including C++, C, CUDA, Fortran, OpenACC, and OpenMP. Each application builds extensively on the other tools available in the programming environment, including scientific libraries, compilers, and other tooling. Our goal is to evaluate application readiness for the next generation of Arm and GPU-based HPC systems and determine the tooling readiness for future application developers. On both accounts, the reported case studies demonstrate that the diversity of software and tools available for GPU-accelerated Arm systems are prepared for production, even before NVIDIA deploys their next-generation such platform: Grace.
翻译:本文评估并报告11个应用团队在新型GPU加速型武器测试床上建立、验证和基准数个HPC应用程序的经验。测试台包括最新(在撰写时)NVIDIA的Arm Devkits以及服务器级 Arm CPU和NVIDIA A100 GPUs。这些应用程序和微型应用程序使用多个平行编程模型编写,包括C++、C、CUDA、Fortran、OpenACC和OpenMP。每个应用程序都广泛利用了编程环境中的其他工具,包括科学图书馆、编译员和其他工具。我们的目标是评估下一代基于Arm和GPU的HPC系统的应用准备情况,并确定未来应用程序开发者的工具准备情况。在这两个账户上,所报告的案例研究表明,即使在NVIDIA部署下一代的这种平台:Grace之前,Grace,为GPUP-Cerederate Arm系统提供了多种软件和工具,供生产。