We assess costs and efficiency of state-of-the-art high performance cloud computing compared to a traditional on-premises compute cluster. Our use case are atomistic simulations carried out with the GROMACS molecular dynamics (MD) toolkit with a focus on alchemical protein-ligand binding free energy calculations. We set up a compute cluster in the Amazon Web Services (AWS) cloud that incorporates various different instances with Intel, AMD, and ARM CPUs, some with GPU acceleration. Using representative biomolecular simulation systems we benchmark how GROMACS performs on individual instances and across multiple instances. Thereby we assess which instances deliver the highest performance and which are the most cost-efficient ones for our use case. We find that, in terms of total costs including hardware, personnel, room, energy and cooling, producing MD trajectories in the cloud can be as cost-efficient as an on-premises cluster given that optimal cloud instances are chosen. Further, we find that high-throughput ligand-screening for protein-ligand binding affinity estimation can be accelerated dramatically by using global cloud resources. For a ligand screening study consisting of 19,872 independent simulations, we used all hardware that was available in the cloud at the time of the study. The computations scaled-up to reach peak performances using more than 10,000 instances, 140,000 cores, and 3,000 GPUs simultaneously around the globe. Our simulation ensemble finished in about two days in the cloud, while weeks would be required to complete the task on a typical on-premises cluster consisting of several hundred nodes. We demonstrate that the costs of such and similar studies can be drastically reduced with a checkpoint-restart protocol that allows to use cheap Spot pricing and by using instance types with optimal cost-efficiency.
翻译:我们评估了最先进的高性能云计算的成本和效率,而与传统的低温模型计算组相比,我们评估了高性能云计算的成本和效率。我们的使用案例是用GROMAACS分子动态工具包进行原子模拟,重点是白化蛋白质和约束性自由能源计算。我们在亚马逊网络服务(AWS)云中设置了一个计算组,其中包括英特尔、AMD和ARM CPU等不同的例子,其中一些是廉价加速的。使用具有代表性的生物分子模拟系统,我们衡量GROMACS在个别事例和多个事例中的表现。我们通过评估哪些事例能提供最高性能,哪些是我们使用该软件的最具成本效益的模型。我们发现,从总成本上看,包括硬件、人员、房间、能源和冷却,在云层中生成MDRT轨道,可以像在最佳的云层场景中那样,通过最佳的云层场景,我们发现高性能驱动力和筛选关于蛋白度和紧凑度估算的功能。通过我们评估的结果可以大大加速进行。在两个例子中,通过使用全球云层模型模拟模拟模拟,在187年的模型中可以快速模拟中进行。我们用来进行最高级的计算,在18值的模型中进行最高级的计算,我们用来进行最高级的计算。