Energy efficiency has emerged as a central challenge for modern high-performance computing (HPC) systems, where escalating computational demands and architectural complexity have led to significant energy footprints. This paper presents the collective experience of the EuroHPC JU Center of Excellence in Exascale CFD (CEEC) in measuring, analyzing, and optimizing energy consumption across major European HPC systems. We briefly review key methodologies and tools for energy measurement as well as define metrics for reporting results. Through case studies using representative CFD applications (waLBerla, FLEXI/GAL{\AE}XI, Neko, and NekRS), we evaluate energy-to-solution and time-to-solution metrics on diverse architectures, including CPU- and GPU-based partitions of LUMI, MareNostrum5, MeluXina, and JUWELS Booster. Our results highlight the advantages of accelerators and mixed-precision techniques for reducing energy consumption while maintaining computational accuracy. Finally, we advocate the need to facilitate energy measurements on HPC systems in order to raise awareness, teach the community, and take actions toward more sustainable exascale computing.
翻译:能源效率已成为现代高性能计算(HPC)系统的核心挑战,计算需求的不断增长与体系结构复杂性的提升导致了显著的能源足迹。本文介绍了欧洲百亿亿次计算流体力学卓越中心(CEEC)在测量、分析和优化欧洲主要HPC系统能耗方面的集体经验。我们简要回顾了能源测量的关键方法与工具,并定义了结果报告的度量标准。通过使用代表性CFD应用(waLBerla、FLEXI/GALÆXI、Neko和NekRS)的案例研究,我们在多种体系结构(包括LUMI、MareNostrum5、MeluXina和JUWELS Booster的CPU与GPU分区)上评估了能耗-求解与时间-求解指标。研究结果突显了加速器与混合精度技术在降低能耗同时保持计算精度方面的优势。最后,我们倡导在HPC系统中推广能耗测量,以提高意识、教育社区,并推动更可持续的百亿亿次计算发展。