In this paper, we summarize our effort to create and utilize a simple framework to coordinate computational analytics tasks with the help of a workflow system. Our design is based on a minimalistic approach while at the same time allowing to access computational resources offered through the owner's computer, HPC computing centers, cloud resources, and distributed systems in general. The access to this framework includes a simple GUI for monitoring and managing the workflow, a REST service, a command line interface, as well as a Python interface. The resulting framework was developed for several examples targeting benchmarks of AI applications on hybrid compute resources and as an educational tool for teaching scientists and students sophisticated concepts to execute computations on resources ranging from a single computer to many thousands of computers as part of on-premise and cloud infrastructure. We demonstrate the usefulness of the tool on a number of examples. The code is available as an open-source project in GitHub and is based on an easy-to-enhance tool called cloudmesh.
翻译:在本文中,我们总结了我们为在工作流程系统的帮助下建立和利用一个简单的框架来协调计算分析任务而做出的努力。我们的设计基于一种最低限度的方法,同时允许获得通过所有者计算机、高PC计算中心、云源资源以及一般分布系统提供的计算资源。进入这一框架包括用于监测和管理工作流程的简单图形、REST服务、指挥线接口以及Python界面。由此而来的框架是为一些例子制定的,这些例子针对的是AI在混合计算资源方面的应用基准,并作为一种教育工具,用于教授科学家和学生精密概念,用于计算从一台计算机到数千台计算机的资源,作为预设和云层基础设施的一部分。我们在若干例子中展示了该工具的有用性。该代码在GitHub作为开放源项目提供,并基于一个称为云群化的简单工具。