Molecular dynamics (MD) simulations are widely used to study large-scale molecular systems. HPC systems are ideal platforms to run these studies, however, reaching the necessary simulation timescale to detect rare processes is challenging, even with modern supercomputers. To overcome the timescale limitation, the simulation of a long MD trajectory is replaced by multiple short-range simulations that are executed simultaneously in an ensemble of simulations. Analyses are usually co-scheduled with these simulations to efficiently process large volumes of data generated by the simulations at runtime, thanks to in situ techniques. Executing a workflow ensemble of simulations and their in situ analyses requires efficient co-scheduling strategies and sophisticated management of computational resources so that they are not slowing down each other. In this paper, we propose an efficient method to co-schedule simulations and in situ analyses such that the makespan of the workflow ensemble is minimized. We present a novel approach to allocate resources for a workflow ensemble under resource constraints by using a theoretical framework modeling the workflow ensemble's execution. We evaluate the proposed approach using an accurate simulator based on the WRENCH simulation framework on various workflow ensemble configurations. Results demonstrate the significance of co-scheduling simulations and in situ analyses that couple data together to benefit from data locality, in which inefficient scheduling decisions can lead up to a factor 30 slowdown in makespan.
翻译:分子动态模拟(MD)被广泛用于研究大型分子系统。HPC系统是进行这些研究的理想平台,但是,即使是现代超级计算机,达到必要的模拟时间尺度以探测稀有过程也具有挑战性。为了克服时间尺度的限制,长MD轨的模拟被在模拟组合中同时执行的多个短程模拟所取代。分析通常与这些模拟同时安排,以便高效处理运行时模拟产生的大量数据,这要归功于现场技术。执行一个模拟及其现场分析的工作流程组合,需要有效的联合安排战略和精密的计算资源管理,这样它们就不会相互减速。在本文中,我们提出了一个有效的方法来共同安排模拟模拟和现场分析,这样就能将工作流程的构成最小化。我们提出了一个新颖的方法,通过使用一个理论框架来模拟工作流程组合执行过程及其现场分析,我们用一个模拟模型来评估拟议的方法,在模拟模型模型中,在模拟模型模型中,可以共同展示关于精确的实地结果的进度分析,在模拟中,在模拟中,在模拟模型中,可以展示关于精确的实地结果的进度分析中,在模拟中,在模拟中,在模拟中,在模拟中,可以对各种结果进行。