The High Energy Physics (HEP) experiments, such as those at the Large Hadron Collider (LHC), traditionally consume large amounts of CPU cycles for detector simulations and data analysis, but rarely use compute accelerators such as GPUs. As the LHC is upgraded to allow for higher luminosity, resulting in much higher data rates, purely relying on CPUs may not provide enough computing power to support the simulation and data analysis needs. As a proof of concept, we investigate the feasibility of porting a HEP parameterized calorimeter simulation code to GPUs. We have chosen to use FastCaloSim, the ATLAS fast parametrized calorimeter simulation. While FastCaloSim is sufficiently fast such that it does not impose a bottleneck in detector simulations overall, significant speed-ups in the processing of large samples can be achieved from GPU parallelization at both the particle (intra-event) and event levels; this is especially beneficial in conditions expected at the high-luminosity LHC, where extremely high per-event particle multiplicities will result from the many simultaneous proton-proton collisions. We report our experience with porting FastCaloSim to NVIDIA GPUs using CUDA. A preliminary Kokkos implementation of FastCaloSim for portability to other parallel architectures is also described.
翻译:高能物理(HEP)实验,如大型高原相撞机(LHC)的实验,传统上消耗大量CPU周期进行检测模拟和数据分析,但很少使用计算加速器,如GPUs。LHC升级后允许更高的光度,导致数据率高得多,完全依赖CPU可能无法提供足够的计算能力来支持模拟和数据分析需要。作为概念的证明,我们调查将HEP参数化热量计模拟代码移植到GPUs的可行性。我们选择了使用快速CaloSim(ATLAS)快速超模化热量计模拟。虽然快速CaloSim(TalCal)的模拟速度足够快,因此它不会在检测模拟中造成更多的瓶颈,但是在大型样品的处理中,在粒子(事件)和事件级别上都能够实现显著的加速速度。作为概念的证明,我们在高光度LHCHC(高反常数粒度)模拟代码中,我们选择使用ATLLAS(ATLAS)快速准焦度快速焦度模拟模拟。使用GC(C)的快速同步)AA(S)的同步)系统(SUDIS)系统(S)的同步)实施模型(S)初步系统(SUDIS-S)系统(S)系统(S)的同步)系统(PIFUDUDUDUDS)结构,将产生大量(S)其他同步)的同步)的初步报告。