C3O:合作集群配置优化公共云中分配数据处理 (C3O: Collaborative Cluster Configuration Optimization for Distributed Data Processing in Public Clouds) - 专知论文

会员服务 ·

0

簇 · Processing（编程语言） · 优化器 · 平均绝对误差 · MoDELS ·

2021 年 7 月 28 日

C3O: Collaborative Cluster Configuration Optimization for Distributed Data Processing in Public Clouds

翻译：C3O:合作集群配置优化公共云中分配数据处理

Jonathan Will,Lauritz Thamsen,Dominik Scheinert,Jonathan Bader,Odej Kao

from arxiv, 10 pages, 5 figures, IEEE IC2E 2021. arXiv admin note: text overlap with arXiv:2011.07965

Distributed dataflow systems enable data-parallel processing of large datasets on clusters. Public cloud providers offer a large variety and quantity of resources that can be used for such clusters. Yet, selecting appropriate cloud resources for dataflow jobs - that neither lead to bottlenecks nor to low resource utilization - is often challenging, even for expert users such as data engineers. We present C3O, a collaborative system for optimizing data processing cluster configurations in public clouds based on shared historical runtime data. The shared data is utilized for predicting the runtimes of data processing jobs on different possible cluster configurations, using specialized regression models. These models take the diverse execution contexts of different users into account and exhibit mean absolute errors below 3% in our experimental evaluation with 930 unique Spark jobs.

翻译：分布式数据流系统能够对大型集群数据集进行数据平行处理。公共云源提供者提供了可用于这些集群的大量种类和数量丰富的资源。然而,为数据流工作选择适当的云源资源(既不导致瓶颈,也不导致资源利用率低)往往具有挑战性,即使是数据工程师等专家用户也是如此。我们提供了C3O,这是一个合作系统,用于根据共同的历史运行时间数据优化公共云层中的数据处理集群配置。共享数据用于预测不同可能的集群配置的数据处理工作运行时间,使用专门的回归模型。这些模型考虑到不同用户的不同执行环境,在实验评估中,有930个独特的Spark工作,显示绝对错误低于3%。

0

相关内容

【ICML2021】低秩Sinkhorn 分解

专知会员服务

39+阅读 · 2021年8月20日

《算法凸几何》简明书，Algorithmic Convex Geometry，50页pdf

专知会员服务

42+阅读 · 2021年4月2日

【2020Manning新书】微前端实战，Micro Frontends in Action，296页pdf

【2020Manning新书】微前端实战，Micro Frontends in Action，296页pdf

专知会员服务

58+阅读 · 2020年8月28日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

【百度】-大规模深度学习广告系统的分布式分层GPU参数服务器，Distributed Hierarchical GPU PS

专知会员服务

24+阅读 · 2020年3月15日

【文献综述】分布式机器学习综述论文，33页pdf，A Survey on Distributed Machine Learning

【文献综述】分布式机器学习综述论文，33页pdf，A Survey on Distributed Machine Learning

专知会员服务

124+阅读 · 2019年12月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

已删除

将门创投

7+阅读 · 2019年10月15日

计算机 | ISMAR 2019等国际会议信息8条

计算机 | ISMAR 2019等国际会议信息8条

Call4Papers

3+阅读 · 2019年3月5日

计算机类 | ISCC 2019等国际会议信息9条

计算机类 | ISCC 2019等国际会议信息9条

Call4Papers

5+阅读 · 2018年12月25日

Evaluation of Distributed Databases in Hybrid Clouds and Edge Computing: Energy, Bandwidth, and Storage Consumption

Arxiv

0+阅读 · 2021年9月29日

Hoplite: Efficient and Fault-Tolerant Collective Communication for Task-Based Distributed Systems

Arxiv

1+阅读 · 2021年9月28日

Distributed Computing With the Cloud

Arxiv

0+阅读 · 2021年9月27日

Quantization for Distributed Optimization

Arxiv

0+阅读 · 2021年9月26日

Communication-Efficient Distributed Linear and Deep Generalized Canonical Correlation Analysis

Arxiv

0+阅读 · 2021年9月25日

Clustering parametric models and normally distributed data

Arxiv

0+阅读 · 2021年9月24日

A Bayesian Optimization Approach for Attenuation Correction in SPECT Brain Imaging

A Bayesian Optimization Approach for Attenuation Correction in SPECT Brain Imaging

Arxiv

0+阅读 · 2021年9月24日

Attributed Graph Clustering via Adaptive Graph Convolution

Arxiv

11+阅读 · 2019年6月4日

Correlated discrete data generation using adversarial training

Arxiv

5+阅读 · 2018年4月3日

Recursive Feature Generation for Knowledge-based Learning

Arxiv

4+阅读 · 2018年1月31日

VIP会员

文章信息

相关主题

Processing（编程语言）

平均绝对误差

相关VIP内容

【ICML2021】低秩Sinkhorn 分解

专知会员服务

39+阅读 · 2021年8月20日

《算法凸几何》简明书，Algorithmic Convex Geometry，50页pdf

专知会员服务

42+阅读 · 2021年4月2日

【2020Manning新书】微前端实战，Micro Frontends in Action，296页pdf

【2020Manning新书】微前端实战，Micro Frontends in Action，296页pdf

专知会员服务

58+阅读 · 2020年8月28日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

【百度】-大规模深度学习广告系统的分布式分层GPU参数服务器，Distributed Hierarchical GPU PS

专知会员服务

24+阅读 · 2020年3月15日

【文献综述】分布式机器学习综述论文，33页pdf，A Survey on Distributed Machine Learning

【文献综述】分布式机器学习综述论文，33页pdf，A Survey on Distributed Machine Learning

专知会员服务

124+阅读 · 2019年12月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

NeurIPS 2025 | 自动化所新作速览（一）

大型语言模型（LLM）赋能的知识图谱构建：综述

NeurIPS 2025 | 自动化所新作速览（二）

领域特定文本分类中的预训练语言模型新进展：系统综述

相关资讯

已删除

将门创投

7+阅读 · 2019年10月15日

计算机 | ISMAR 2019等国际会议信息8条

计算机 | ISMAR 2019等国际会议信息8条

Call4Papers

3+阅读 · 2019年3月5日

计算机类 | ISCC 2019等国际会议信息9条

计算机类 | ISCC 2019等国际会议信息9条

Call4Papers

5+阅读 · 2018年12月25日

相关论文

Evaluation of Distributed Databases in Hybrid Clouds and Edge Computing: Energy, Bandwidth, and Storage Consumption

Arxiv

0+阅读 · 2021年9月29日

Hoplite: Efficient and Fault-Tolerant Collective Communication for Task-Based Distributed Systems

Arxiv

1+阅读 · 2021年9月28日

Distributed Computing With the Cloud

Arxiv

0+阅读 · 2021年9月27日

Quantization for Distributed Optimization

Arxiv

0+阅读 · 2021年9月26日

Communication-Efficient Distributed Linear and Deep Generalized Canonical Correlation Analysis

Arxiv

0+阅读 · 2021年9月25日

Clustering parametric models and normally distributed data

Arxiv

0+阅读 · 2021年9月24日

A Bayesian Optimization Approach for Attenuation Correction in SPECT Brain Imaging

A Bayesian Optimization Approach for Attenuation Correction in SPECT Brain Imaging

Arxiv

0+阅读 · 2021年9月24日

Attributed Graph Clustering via Adaptive Graph Convolution

Arxiv

11+阅读 · 2019年6月4日

Correlated discrete data generation using adversarial training

Arxiv

5+阅读 · 2018年4月3日

Recursive Feature Generation for Knowledge-based Learning

Arxiv

4+阅读 · 2018年1月31日

微信扫码咨询专知VIP会员