Many scientific workflows can be represented by a Directed Acyclic Graph (DAG) where each node represents a task, and there will be a directed edge between two tasks if and only if there is a dependency relationship between the two i.e. the second one can not be started unless the first one is finished. Due to the increasing computational requirements of these workflows, they are deployed on cloud computing systems. Scheduling of workflows on such systems to achieve certain goals(e.g. minimization of makespan, cost, or maximization of reliability, etc.) remains an active area of research. In this paper, we propose a scheduling algorithm for allocating the nodes of our task graph in a heterogeneous multi-cloud system. The proposed scheduler considers many practical concerns such as pricing mechanisms, discounting schemes, and reliability analysis for task execution. This is a list-based heuristic that allocates tasks based on the expected times for which VMs need to be rented for them. We have analyzed the proposed approach to understand its time requirement. We perform a large number of experiments with real-world workflows: FFT, Ligo, Epigenomics, and Random workflows and observe that the proposed scheduler outperforms the state-of-art approaches up to 12%, 11%, and 1.1% in terms of cost, makespan, and reliability, respectively.
翻译:许多科学工作流程可以用一个直接环形图(DAG)代表许多科学工作流程,其中每个节点代表着一项任务,如果而且只有在两种任务之间有依赖关系的情况下,两个任务之间才有直接的边际,即,第二个任务不能启动,除非第一个任务完成。由于这些工作流程的计算要求不断增加,这些工作流程被部署在云计算系统中。将工作流程安排在这种系统上,以实现某些目标(例如,最小化月球、成本或最大程度的可靠性等),这仍然是一个活跃的研究领域。在本文件中,我们提议了一个将我们的任务图节点分配到一个多样化的多层系统中的时间安排算法。提议的调度员考虑许多实际问题,如定价机制、贴现计划和任务执行的可靠性分析。这是一个基于列表的超常,根据VMS需要租用的预期时间分配任务。我们分析了为了解其时间要求而提议的方法。我们用大量实际世界工作流程进行实验:FFT、Ligo、Epigenartimical 和11-resmall-ressional-ressional-restial-prolations laps lats, 12-pal-restimal-pal-pal-pal-pal-cs restics) 。