Empirical Dynamic Modeling (EDM) is a state-of-the-art non-linear time-series analysis framework. Despite its wide applicability, EDM was not scalable to large datasets due to its expensive computational cost. To overcome this obstacle, researchers have attempted and succeeded in accelerating EDM from both algorithmic and implementational aspects. In previous work, we developed a massively parallel implementation of EDM targeting HPC systems (mpEDM). However, mpEDM maintains different backends for different architectures. This design becomes a burden in the increasingly diversifying HPC systems, when porting to new hardware. In this paper, we design and develop a performance-portable implementation of EDM based on the Kokkos performance portability framework (kEDM), which runs on both CPUs and GPUs while based on a single codebase. Furthermore, we optimize individual kernels specifically for EDM computation, and use real-world datasets to demonstrate up to $5.5\times$ speedup compared to mpEDM in convergent cross mapping computation.
翻译:经验动态模型(EDM)是一个最先进的非线性时间序列分析框架。尽管它具有广泛适用性,但EDM由于其昂贵的计算成本,无法向大型数据集扩展。为了克服这一障碍,研究人员尝试并成功地从算法和执行方面加速了EDM。在以往的工作中,我们开发了大规模平行实施EDM系统,专门针对HPC系统(mpEDM)。然而,MPEDM为不同的结构保留了不同的后端。这种设计在将HPC系统移植到新硬件时,成为日益多样化的HPC系统的一个负担。在本文中,我们根据Kokkos性能可移植性框架(kEMM)设计和开发了EDM的可操作性操作性实施,该框架在单一代码基的基础上运行于CPU和GPU。此外,我们优化了专门用于EDM计算的个人内核,并使用真实世界数据集,以显示在综合交叉绘图中与 MP 相比,速度高达5.5美元。