Scientific workflows have been used almost universally across scientific domains, and have underpinned some of the most significant discoveries of the past several decades. Many of these workflows have high computational, storage, and/or communication demands, and thus must execute on a wide range of large-scale platforms, from large clouds to upcoming exascale high-performance computing (HPC) platforms. These executions must be managed using some software infrastructure. Due to the popularity of workflows, workflow management systems (WMSs) have been developed to provide abstractions for creating and executing workflows conveniently, efficiently, and portably. While these efforts are all worthwhile, there are now hundreds of independent WMSs, many of which are moribund. As a result, the WMS landscape is segmented and presents significant barriers to entry due to the hundreds of seemingly comparable, yet incompatible, systems that exist. As a result, many teams, small and large, still elect to build their own custom workflow solution rather than adopt, or build upon, existing WMSs. This current state of the WMS landscape negatively impacts workflow users, developers, and researchers. The "Workflows Community Summit" was held online on January 13, 2021. The overarching goal of the summit was to develop a view of the state of the art and identify crucial research challenges in the workflow community. Prior to the summit, a survey sent to stakeholders in the workflow community (including both developers of WMSs and users of workflows) helped to identify key challenges in this community that were translated into 6 broad themes for the summit, each of them being the object of a focused discussion led by a volunteer member of the community. This report documents and organizes the wealth of information provided by the participants before, during, and after the summit.
翻译:科学工作流程几乎在科学领域普遍使用,这些工作流程是过去几十年中一些最重要的发现的基础。许多工作流程在计算、储存和(或)通信需求方面都有很高的计算、存储和(或)通信需求,因此必须在从大云层到即将推出的高级性能计算(HPC)平台等一系列大型平台上执行。这些处决必须使用某些软件基础设施进行管理。由于工作流程的普及,工作流程管理系统(WMS)的开发为创建和执行工作流程提供了抽象信息,方便、高效和可移植地创建和实施工作流程。尽管这些努力都值得,但现在已有数百个独立的WMS网络网站,其中许多网络的网络主题已经破碎。因此,WMS平台的景观被分割,并且由于存在数百个看上去相似但互不兼容的系统(HPC)平台,对进入这些平台构成了重大障碍。由于工作流程的普及,许多团队,大小团队仍然选择建立自己的客户工作流程解决方案,而不是采用或利用现有的WMS系统。WMS平台当前的状况对工作流程用户、开发商和研究人员产生了不利影响。在WMS峰会之前,有数百个独立的网络工作流程的用户、开发商和研究人员。在社区峰会上将社区内部的每个用户的版本中都有一个重要版本的版本。在11月13次的版本中,在社区峰会中,向社区峰会上发布了一份在线的版本的版本中发布了一份在线的版本的版本。