Workflows applications are becoming increasingly important to support scientific discovery. That is leading to a proliferation of workflow management systems and, thus, to a fragmented software ecosystem. Integration among existing workflow tools can improve development efficiency and, ultimately, increase the sustainability of scientific workflow software. We describe our experience with integrating RADICAL-Pilot (RP) and Parsl as a way to enable users to develop and execute workflow applications with heterogeneous tasks on heterogeneous high-performance computing resources. We describe our approach to the integration of the two systems and detail the development of RPEX, a Parsl executor which uses RP as its workload manager. We develop an RP executor that executes heterogeneous MPI Python functions on CPU cores and GPUs. We measure the weak and strong scaling of RPEX, RP, and Parsl when providing new capabilities to two paradigmatic use cases: Colmena and Ice Wedge Polygons
翻译:工作流程应用对于支持科学发现日益重要,这正在导致工作流程管理系统激增,从而形成一个支离破碎的软件生态系统。现有工作流程工具的整合可以提高发展效率,并最终提高科学工作流程软件的可持续性。我们描述了我们整合RADicric-Pilot(RP)和Parsl(Parsl)的经验,以便用户能够开发和实施工作流程应用,在多种高性能计算资源方面执行不同任务。我们描述了我们整合这两个系统的方法,并详细介绍了RPEX的发展,这是一个利用RP管理工作量管理的Parsl执行者。我们开发了一名RP执行者,在CPU核心和GPUs上执行混合的MPI Python功能。我们衡量RPEX、RP和Parsl在为两个模式使用案例提供新能力时的薄弱和强度规模。Colmena和Ice Wedge Polggons。