As part of the Exascale Computing Project (ECP), a recent focus of development efforts for the SUite of Nonlinear and DIfferential/ALgebraic equation Solvers (SUNDIALS) has been to enable GPU-accelerated time integration in scientific applications at extreme scales. This effort has resulted in several new GPU-enabled implementations of core SUNDIALS data structures, support for programming paradigms which are aware of the heterogeneous architectures, and the introduction of utilities to provide new points of flexibility. In this paper, we discuss our considerations, both internal and external, when designing these new features and present the features themselves. We also present performance results for several of the features on the Summit supercomputer and early access hardware for the Frontier supercomputer, which demonstrate negligible performance overhead resulting from the additional infrastructure and significant speedups when using both NVIDIA and AMD GPUs.
翻译:作为 " 速成计算项目 " 的一部分,最近发展非线性和代费性/代数等式解答器(SUNDIALS)的工作重点之一是在极端规模的科学应用方面加快GPU的加速时间整合,使GPU在极端规模的科学应用中能够加速时间整合,这导致实施了几个新的以GPU带动的SUDIALS核心数据结构,支持了解多种结构的方案拟订模式,并引进了提供新灵活性点的公用事业。在本文件中,我们在设计这些新特征并介绍这些特征时,讨论了我们的内部和外部考虑。我们还介绍了峰会上的一些超级计算机和边境超级计算机早期进入硬件的绩效结果,这些功能表明由于使用NVIDIA和AMD GPU而增加的基础设施以及大量超速率,导致的性能间接费用微不足道。