Considering the problem of resource allocation for potentially complex and diverse streaming (e.g., query processing) or long-running iterative (e.g., deep learning) workloads in the public cloud, we argue that a framework based on simulated annealing is suitable for navigating performance/cost trade-offs when selecting from among heterogeneous service offerings. Annealing is particularly useful when the complex workload and heterogeneous service offerings may vary over time. Based on a macroscopic objective that combines both performance and cost terms, annealing facilitates light-weight and coherent policies of exploration and exploitation when considering the service suite offered by a cloud provider. In this paper, we first give some background on simulated annealing and then demonstrate through experiments the usefulness of a particular resource management framework based on it: selecting the types and numbers of virtual machines for a particular job stream.
翻译:考虑到公共云层中潜在复杂和多样化流流(例如查询处理)或长期迭代(例如深学习)工作量的资源分配问题,我们认为,基于模拟排泄的框架在从多种服务提供中选择业绩/成本权衡时,是合适的。当复杂工作量和多种服务提供可能随时间而变化时,安纳林特别有用。根据将业绩和成本条件结合起来的宏观目标,在考虑云源提供者提供的服务套件时,安纳林有助于采取轻量和连贯的勘探和开发政策。在本文件中,我们首先介绍一些模拟排泄的背景,然后通过实验展示基于这一框架的特定资源管理框架的有用性:为某一特定工作流选择虚拟机器的类型和数量。