We present results of a performance study of an astrophysical radiation hydrodynamics code, V2D, on the Arm-based A64FX processor developed by Fujitsu. The code solves sparse linear systems, a task for which the A64FX architecture should be well suited. We performed the performance analysis study on Ookami, an Apollo 80 platform utilizing the A64FX processor. We explored several compilers and performance analysis packages and found the code did not perform as expected under scalable vector extension optimization, suggesting that a "deeper dive" into analyzing the code is worthwhile. However, a simple driver program that exercised basic sparse linear algebra routines used by V2D did show significant speedup with the use of the scalable vector extension optimization. We present the initial results from the study which used V2D on a relatively simple test problem that emphasized the repeated solution of sparse linear systems.
翻译:我们对藤津开发的以 Arm为基础的A64FX处理器上天体辐射流体动力学代码V2D的性能研究结果进行介绍。该代码解决了稀薄线性系统,A64FX结构应该非常适合这项任务。我们对使用A64FX处理器的AFOL 80号平台Ookami进行了性能分析研究。我们探讨了几个编译者和性能分析包,发现该代码在可缩放矢量扩展优化下没有达到预期效果,这表明在分析该代码时“更深的潜水”是值得的。然而,一个简单的驱动程序,在应用V2D使用的基本稀薄线性线性代数常规时,确实显示出使用可缩放矢量扩展优化的快速。我们介绍了该研究的初步结果,该研究用V2D处理了一个相对简单的测试问题,强调稀薄线性系统的重复解决方案。