The use of approximation is fundamental in computational science. Almost all computational methods adopt approximations in some form in order to obtain a favourable cost/accuracy trade-off and there are usually many approximations that could be used. As a result, when a researcher wishes to measure a property of a system with a computational technique, they are faced with an array of options. Current computational workflow frameworks focus on helping researchers automate a sequence of steps on a particular platform. The aim is often to obtain a computational measurement of a property. However these frameworks are unaware that there may be a large number of ways to do so. As such, they cannot support researchers in making these choices during development or at execution-time. We argue that computational workflow frameworks should be designed to be \textit{approximation-aware} - that is, support the fact that a given workflow description represents a task that \textit{could} be performed in different ways. This is key to unlocking the potential of computational workflows to accelerate discovery tasks, particularly those involving searches of large entity spaces. It will enable efficiently obtaining measurements of entity properties, given a set of constraints, by directly leveraging the space of choices available. In this paper we describe the basic functions that an approximation-aware workflow framework should provide, how those functions can be realized in practice, and illustrate some of the powerful capabilities it would enable, including approximate memoization, surrogate model support, and automated workflow composition.
翻译:近似的使用在计算科学中至关重要。几乎所有计算方法都采用某种形式的近近,以获得有利的成本/准确的权衡,通常可以使用许多近似。因此,当研究人员希望用计算技术测量系统属性时,他们面临一系列选项。当前的计算工作流程框架侧重于帮助研究人员在特定平台上实现一系列步骤的自动化。目的是经常获得对财产的计算计量。但这些框架并不意识到可能存在大量的方法。因此,他们无法支持研究人员在发展或执行时作出这些选择。我们主张计算工作流程框架的设计应当是\textit{approcolomation-aware}-也就是说,支持以下事实,即特定工作流程描述是一个任务,可以以不同方式执行。这是释放计算工作流程潜力以加速发现任务的关键,特别是涉及搜索大实体空间的任务。因此,他们无法在开发或执行时支持研究人员作出这些选择。我们主张计算工作流程框架的设计应当是:计算系统属性的属性属性属性的属性属性属性属性的属性属性属性属性,根据我们现有基本文件的准确性能说明我们如何利用这些实际的准确性框架。