The ever-increasing gap between compute and I/O performance in HPC platforms, together with the development of novel NVMe storage devices (NVRAM), led to the emergence of the burst buffer concept - an intermediate persistent storage layer logically positioned between random-access main memory and a parallel file system. Despite the development of real-world architectures as well as research concepts, resource and job management systems, such as Slurm, provide only marginal support for scheduling jobs with burst buffer requirements, in particular ignoring burst buffers when backfilling. We investigate the impact of burst buffer reservations on the overall efficiency of online job scheduling for common algorithms: First-Come-First-Served (FCFS) and Shortest-Job-First (SJF) EASY-backfilling. We evaluate the algorithms in a detailed simulation with I/O side effects. Our results indicate that the lack of burst buffer reservations in backfilling may significantly deteriorate scheduling. We also show that these algorithms can be easily extended to support burst buffers. Finally, we propose a burst-buffer-aware plan-based scheduling algorithm with simulated annealing optimisation, which improves the mean waiting time by over 20% and mean bounded slowdown by 27% compared to the burst-buffer-aware SJF-EASY-backfilling.
翻译:HPC平台的计算和I/O性能之间日益扩大的差距,加上开发了新型的NVMe存储装置(NRRAM),导致了爆发缓冲概念的出现 -- -- 中间的持久性储存层,在随机访问主内存和平行文件系统之间逻辑定位。尽管开发了真实世界架构以及研究概念、资源和工作管理系统,如Slurm,但对于有爆炸缓冲要求的工作时间安排,只能提供边际支持,特别是当回填时忽略爆裂缓冲。我们调查了爆发缓冲保留对通用算法(FCFFS)和短期-Job-IF(SJFF)在线工作时间安排总体效率的影响。我们用IMBRE-FS-S-FAFA-FA的缓冲调整来详细评估算法及其I/O侧效应。我们的结果表明,回填时的缓冲缓冲保留可能大大地恶化了时间安排。我们还表明,这些算法可以很容易扩展以支持爆缓冲。最后,我们建议用缓冲式缓冲计划对通用算法的在线工作安排总效率的影响:先到S-BRAFA-B-FA-FA-B-C-C-C-C-C-C-S-FRA-C-C-C-C-C-S-S-C-S-S-B-S-S-S-S-S-B-S-S-F-F-F-F-F-F-F-F-F-F-F-F-F-FRAF-S-S-S-C-C-C-C-C-S-C-S-S-S-S-S-S-S-S-F-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-