RFaaS:无服务器高性能计算机的RDMA-Enabled FaaS平台 (RFaaS: RDMA-Enabled FaaS Platform for Serverless High-Performance Computing) - 专知论文

会员服务 ·

0

Performer · 泛函 · 簇 · 资源管理 · MoDELS ·

2021 年 6 月 25 日

RFaaS: RDMA-Enabled FaaS Platform for Serverless High-Performance Computing

翻译：RFaaS:无服务器高性能计算机的RDMA-Enabled FaaS平台

Marcin Copik,Konstantin Taranov,Alexandru Calotoiu,Torsten Hoefler

The rigid MPI programming model and batch scheduling dominate high-performance computing. While clouds brought new levels of elasticity into the world of computing, supercomputers still suffer from low resource utilization rates. To enhance supercomputing clusters with the benefits of serverless computing, a modern cloud programming paradigm for pay-as-you-go execution of stateless functions, we present rFaaS, the first RDMA-aware Function-as-a-Service (FaaS) platform. With hot invocations and decentralized function placement, we overcome the major performance limitations of FaaS systems and provide low-latency remote invocations in multi-tenant environments. We evaluate the new serverless system through a series of microbenchmarks and show that remote functions execute with negligible performance overheads. We demonstrate how serverless computing can bring elastic resource management into MPI-based high-performance applications. Overall, our results show that MPI applications can benefit from modern cloud programming paradigms to guarantee high performance at lower resource costs.

翻译：硬性MPI编程模型和批量时间安排模式主导着高性能计算。虽然云层给计算世界带来了新的弹性水平,但超级计算机仍然受到低资源利用率的影响。为了提高超载集群,利用无服务器计算的好处(一个现代的“现收现付”执行无国籍功能的云式编程模式)加强超载集群,我们展示了第一个“RFaaS”,即第一个“RDMA-aware 函数-as-as-Service(FaS)”平台。随着热度的探索和分散功能定位,我们克服了FaaS系统的主要性能限制,并在多租赁环境中提供了低持久性远程职业。我们通过一系列微信标来评估新的无服务器系统,并表明远程功能与微不足道的性能管理间接费用一起运行。我们展示了无服务器计算如何将弹性资源管理引入基于MPI的高性能应用程序。总体结果显示,MPI的应用可以受益于现代云型编程模式,以保证低资源成本的高性能。

1

相关内容

Performer

【ICML2021】全局鲁棒神经网络

专知会员服务

22+阅读 · 2021年8月26日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

【O'Reilly TensorFlow World 2019】使用transformer架构的自然语言处理（Natural language processing using transformer architectures），Kiwisoft的机器学习顾问Aurelien Geron

【O'Reilly TensorFlow World 2019】使用transformer架构的自然语言处理（Natural language processing using transformer architectures），Kiwisoft的机器学习顾问Aurelien Geron

专知会员服务

17+阅读 · 2019年11月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

196+阅读 · 2019年10月10日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

专访阿里亚顿：Serverless与BFF与前端

专访阿里亚顿：Serverless与BFF与前端

前端之巅

45+阅读 · 2019年5月8日

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

泡泡机器人SLAM

12+阅读 · 2019年1月16日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

计算机类 | ISCC 2019等国际会议信息9条

计算机类 | ISCC 2019等国际会议信息9条

Call4Papers

5+阅读 · 2018年12月25日

【泡泡点云时空】基于增量分割的3D点云定位方法（ICRA2018-4）

【泡泡点云时空】基于增量分割的3D点云定位方法（ICRA2018-4）

泡泡机器人SLAM

13+阅读 · 2018年10月7日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

已删除

将门创投

5+阅读 · 2017年11月20日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

大数据的分布式算法

大数据的分布式算法

待字闺中

3+阅读 · 2017年6月13日

Spectral and Energy Efficiency of ACO-OFDM in Visible Light Communication Systems

Spectral and Energy Efficiency of ACO-OFDM in Visible Light Communication Systems

Arxiv

0+阅读 · 2021年8月31日

On computing derivatives of transfer operators and linear responses in higher dimensions

Arxiv

0+阅读 · 2021年8月31日

AuctionWhisk: Using an Auction-Inspired Approach for Function Placement in Serverless Fog Platforms

Arxiv

0+阅读 · 2021年8月30日

Robust Privacy-Preserving Motion Detection and Object Tracking in Encrypted Streaming Video

Arxiv

0+阅读 · 2021年8月30日

Harvesting Idle Resources in Serverless Computing via Reinforcement Learning

Arxiv

0+阅读 · 2021年8月28日

Simulation of Hybrid Edge Computing Architectures

Arxiv

0+阅读 · 2021年8月28日

Computationally Efficient Optimization of Plackett-Luce Ranking Models for Relevance and Fairness

Arxiv

5+阅读 · 2021年5月3日

Efficient Parameter-free Clustering Using First Neighbor Relations

Efficient Parameter-free Clustering Using First Neighbor Relations

Arxiv

7+阅读 · 2019年2月28日

Localization Recall Precision (LRP): A New Performance Metric for Object Detection

Localization Recall Precision (LRP): A New Performance Metric for Object Detection

Arxiv

4+阅读 · 2018年7月4日

MXNET-MPI: Embedding MPI parallelism in Parameter Server Task Model for scaling Deep Learning

Arxiv

4+阅读 · 2018年1月11日

VIP会员

文章信息

相关主题

相关VIP内容

【ICML2021】全局鲁棒神经网络

专知会员服务

22+阅读 · 2021年8月26日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

【O'Reilly TensorFlow World 2019】使用transformer架构的自然语言处理（Natural language processing using transformer architectures），Kiwisoft的机器学习顾问Aurelien Geron

【O'Reilly TensorFlow World 2019】使用transformer架构的自然语言处理（Natural language processing using transformer architectures），Kiwisoft的机器学习顾问Aurelien Geron

专知会员服务

17+阅读 · 2019年11月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

196+阅读 · 2019年10月10日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《驻地训练手册》美陆军最新72页

《量子隧穿认知神经网络在军民车辆识别与情感分析中的应用》最新论文

俄罗斯对乌克兰无人机作战的战略适应性分析

《美国海岸警卫队2028部队设计执行计划摘要》最新32页

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

专访阿里亚顿：Serverless与BFF与前端

专访阿里亚顿：Serverless与BFF与前端

前端之巅

45+阅读 · 2019年5月8日

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

泡泡机器人SLAM

12+阅读 · 2019年1月16日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

计算机类 | ISCC 2019等国际会议信息9条

计算机类 | ISCC 2019等国际会议信息9条

Call4Papers

5+阅读 · 2018年12月25日

【泡泡点云时空】基于增量分割的3D点云定位方法（ICRA2018-4）

【泡泡点云时空】基于增量分割的3D点云定位方法（ICRA2018-4）

泡泡机器人SLAM

13+阅读 · 2018年10月7日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

已删除

将门创投

5+阅读 · 2017年11月20日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

大数据的分布式算法

大数据的分布式算法

待字闺中

3+阅读 · 2017年6月13日

相关论文

Spectral and Energy Efficiency of ACO-OFDM in Visible Light Communication Systems

Spectral and Energy Efficiency of ACO-OFDM in Visible Light Communication Systems

Arxiv

0+阅读 · 2021年8月31日

On computing derivatives of transfer operators and linear responses in higher dimensions

Arxiv

0+阅读 · 2021年8月31日

AuctionWhisk: Using an Auction-Inspired Approach for Function Placement in Serverless Fog Platforms

Arxiv

0+阅读 · 2021年8月30日

Robust Privacy-Preserving Motion Detection and Object Tracking in Encrypted Streaming Video

Arxiv

0+阅读 · 2021年8月30日

Harvesting Idle Resources in Serverless Computing via Reinforcement Learning

Arxiv

0+阅读 · 2021年8月28日

Simulation of Hybrid Edge Computing Architectures

Arxiv

0+阅读 · 2021年8月28日

Computationally Efficient Optimization of Plackett-Luce Ranking Models for Relevance and Fairness

Arxiv

5+阅读 · 2021年5月3日

Efficient Parameter-free Clustering Using First Neighbor Relations

Efficient Parameter-free Clustering Using First Neighbor Relations

Arxiv

7+阅读 · 2019年2月28日

Localization Recall Precision (LRP): A New Performance Metric for Object Detection

Localization Recall Precision (LRP): A New Performance Metric for Object Detection

Arxiv

4+阅读 · 2018年7月4日

MXNET-MPI: Embedding MPI parallelism in Parameter Server Task Model for scaling Deep Learning

Arxiv

4+阅读 · 2018年1月11日

微信扫码咨询专知VIP会员