PGAS 运行时间的可缩放的以动作器为基础的编程系统 (A Scalable Actor-based Programming System for PGAS Runtimes) - 专知论文

会员服务 ·

0

Performer · CASES · Scala · Rust · 操作 ·

2021 年 7 月 12 日

A Scalable Actor-based Programming System for PGAS Runtimes

翻译：PGAS 运行时间的可缩放的以动作器为基础的编程系统

Sri Raj Paul,Akihiro Hayashi,Kun Chen,Vivek Sarkar

PGAS runtimes are well suited to irregular applications due to their support for short, one-sided messages. However, there are two major sources of overhead in PGAS runtimes that prevent them from achieving acceptable performance on large scale parallel systems. First, despite the availability of APIs that support non-blocking operations for important special cases, many PGAS operations on remote locations are synchronous by default, which can lead to long-latency stalls. Second, efficient inter-node communication requires careful aggregation and management of short messages. Experiments has shown that the performance of PGAS programs can be improved by more than 20$\times$ through the use of specialized lower-level libraries, such as Conveyors, but with a significant impact on programming productivity. The actor model has been gaining popularity in many modern programming languages such as Scala or Rust and also within the cloud computing community. In this paper, we introduce a new programming system for PGAS runtimes, in which all remote operations are asynchronous by default through the use of an actor-based programming system. In this approach, the programmer does not need to worry about complexities related to message aggregation and termination detection. Thus, our approach offers a desirable point in the productivity-performance spectrum, with scalable performance that approaches that of lower-level aggregation libraries but with higher productivity.

翻译：PGAS运行时间非常适合非常规应用,因为它们支持短片片面信息。然而,PGAS运行时间有两大管理费用来源,使得无法在大型平行系统中取得可接受的业绩。首先,尽管有支持重要特殊情况下非阻塞操作的APIS,但许多偏远地点的PGAS运行因默认而同步,可能导致长期拖延。第二,高效的节点通信需要仔细汇总和管理短片信息。实验表明,PGAS程序的运作可以通过使用专门的低层图书馆(如Conveyors)来改善20多美元的时间,但是对方案编制生产率产生重大影响。许多现代编程语言(如Scala或Rust)和云计算界内部的动作模式越来越受欢迎。在本文中,我们为PGAS运行时间引入了新的编程系统,所有远程业务都通过使用基于行为者的编程系统而违约,从而可以提高20多美元。在这一方法中,方案与高层次的图书馆(例如Conveylers)相比,其运作模式与高水平的运行方式并不令人担心。

0

相关内容

Performer

【因果人工智能系统】106页ppt，Causal AI for Systems

专知会员服务

97+阅读 · 2021年8月28日

熊辉等首篇「深度学习图异常检测」综述论文，176篇文献全面概述GAD技术

熊辉等首篇「深度学习图异常检测」综述论文，176篇文献全面概述GAD技术

专知会员服务

83+阅读 · 2021年6月23日

【XAUTOML】可解释自动机器学习，27页ppt

【XAUTOML】可解释自动机器学习，27页ppt

专知会员服务

65+阅读 · 2021年4月23日

【2020新书】Web应用安全，331页pdf

【2020新书】Web应用安全，331页pdf

专知会员服务

25+阅读 · 2020年10月24日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

196+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

计算机类 | PLDI 2020等国际会议信息6条

计算机类 | PLDI 2020等国际会议信息6条

Call4Papers

3+阅读 · 2019年7月8日

计算机 | USENIX Security 2020等国际会议信息5条

计算机 | USENIX Security 2020等国际会议信息5条

Call4Papers

7+阅读 · 2019年4月25日

计算机 | CCF推荐期刊专刊信息5条

计算机 | CCF推荐期刊专刊信息5条

Call4Papers

3+阅读 · 2019年4月10日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

大数据 | 顶级SCI期刊专刊/国际会议信息7条

大数据 | 顶级SCI期刊专刊/国际会议信息7条

Call4Papers

10+阅读 · 2018年12月29日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

人工智能 | PRICAI 2019等国际会议信息9条

人工智能 | PRICAI 2019等国际会议信息9条

Call4Papers

6+阅读 · 2018年12月13日

carla 学习笔记

carla 学习笔记

CreateAMind

9+阅读 · 2018年2月7日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

Self-composable Programming

Arxiv

0+阅读 · 2021年9月14日

Refinable Function : An Object-oriented Approach to Procedure Modularity

Arxiv

0+阅读 · 2021年9月14日

Sparse PointPillars: Maintaining and Exploiting Input Sparsity to Improve Runtime on Embedded Systems

Arxiv

0+阅读 · 2021年9月14日

Scalable Scene Flow from Point Clouds in the Real World

Arxiv

0+阅读 · 2021年9月13日

On pareto-optimal fronts for diminishment of lane-changing impact in mixed traffic

Arxiv

0+阅读 · 2021年9月13日

Spike2Vec: An Efficient and Scalable Embedding Approach for COVID-19 Spike Sequences

Spike2Vec: An Efficient and Scalable Embedding Approach for COVID-19 Spike Sequences

Arxiv

0+阅读 · 2021年9月12日

DeepOPF: A Feasibility-Optimized Deep Neural Network Approach for AC Optimal Power Flow Problems

DeepOPF: A Feasibility-Optimized Deep Neural Network Approach for AC Optimal Power Flow Problems

Arxiv

0+阅读 · 2021年9月10日

DIRECT: A Differential Dynamic Programming Based Framework for Trajectory Generation

Arxiv

0+阅读 · 2021年9月10日

Analytical Process Scheduling Optimization for Heterogeneous Multi-core Systems

Arxiv

0+阅读 · 2021年9月10日

Linear SLAM: Linearising the SLAM Problems using Submap Joining

Linear SLAM: Linearising the SLAM Problems using Submap Joining

Arxiv

3+阅读 · 2018年9月18日

VIP会员

文章信息

相关主题

相关VIP内容

【因果人工智能系统】106页ppt，Causal AI for Systems

专知会员服务

97+阅读 · 2021年8月28日

熊辉等首篇「深度学习图异常检测」综述论文，176篇文献全面概述GAD技术

熊辉等首篇「深度学习图异常检测」综述论文，176篇文献全面概述GAD技术

专知会员服务

83+阅读 · 2021年6月23日

【XAUTOML】可解释自动机器学习，27页ppt

【XAUTOML】可解释自动机器学习，27页ppt

专知会员服务

65+阅读 · 2021年4月23日

【2020新书】Web应用安全，331页pdf

【2020新书】Web应用安全，331页pdf

专知会员服务

25+阅读 · 2020年10月24日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

196+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【ACL2025教程】大语言模型的护栏与安全性：对其应用的安全、可靠与可控引导

《实现协同自主：从人机协作到多智能体系统》最新190页

【ICML2025】SToFM：一种用于空间转录组学的多尺度基础模型

通信网络智能体白皮书V1.0，61页pdf

相关资讯

计算机类 | PLDI 2020等国际会议信息6条

计算机类 | PLDI 2020等国际会议信息6条

Call4Papers

3+阅读 · 2019年7月8日

计算机 | USENIX Security 2020等国际会议信息5条

计算机 | USENIX Security 2020等国际会议信息5条

Call4Papers

7+阅读 · 2019年4月25日

计算机 | CCF推荐期刊专刊信息5条

计算机 | CCF推荐期刊专刊信息5条

Call4Papers

3+阅读 · 2019年4月10日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

大数据 | 顶级SCI期刊专刊/国际会议信息7条

大数据 | 顶级SCI期刊专刊/国际会议信息7条

Call4Papers

10+阅读 · 2018年12月29日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

人工智能 | PRICAI 2019等国际会议信息9条

人工智能 | PRICAI 2019等国际会议信息9条

Call4Papers

6+阅读 · 2018年12月13日

carla 学习笔记

carla 学习笔记

CreateAMind

9+阅读 · 2018年2月7日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

相关论文

Self-composable Programming

Arxiv

0+阅读 · 2021年9月14日

Refinable Function : An Object-oriented Approach to Procedure Modularity

Arxiv

0+阅读 · 2021年9月14日

Sparse PointPillars: Maintaining and Exploiting Input Sparsity to Improve Runtime on Embedded Systems

Arxiv

0+阅读 · 2021年9月14日

Scalable Scene Flow from Point Clouds in the Real World

Arxiv

0+阅读 · 2021年9月13日

On pareto-optimal fronts for diminishment of lane-changing impact in mixed traffic

Arxiv

0+阅读 · 2021年9月13日

Spike2Vec: An Efficient and Scalable Embedding Approach for COVID-19 Spike Sequences

Spike2Vec: An Efficient and Scalable Embedding Approach for COVID-19 Spike Sequences

Arxiv

0+阅读 · 2021年9月12日

DeepOPF: A Feasibility-Optimized Deep Neural Network Approach for AC Optimal Power Flow Problems

DeepOPF: A Feasibility-Optimized Deep Neural Network Approach for AC Optimal Power Flow Problems

Arxiv

0+阅读 · 2021年9月10日

DIRECT: A Differential Dynamic Programming Based Framework for Trajectory Generation

Arxiv

0+阅读 · 2021年9月10日

Analytical Process Scheduling Optimization for Heterogeneous Multi-core Systems

Arxiv

0+阅读 · 2021年9月10日

Linear SLAM: Linearising the SLAM Problems using Submap Joining

Linear SLAM: Linearising the SLAM Problems using Submap Joining

Arxiv

3+阅读 · 2018年9月18日

微信扫码咨询专知VIP会员