实时工作量内容- 软件 GPU 分割和任务到任务分配 (Contention-Aware GPU Partitioning and Task-to-Partition Allocation for Real-Time Workloads) - 专知论文

会员服务 ·

0

GPU · 核化 · 簇 · SimPLe · 划分 ·

2021 年 5 月 21 日

Contention-Aware GPU Partitioning and Task-to-Partition Allocation for Real-Time Workloads

翻译：实时工作量内容- 软件 GPU 分割和任务到任务分配

Houssam-Eddine Zahaf,Ignacio Sanudo Olmedo,Jayati Singh,Nicola Capodieci,Sebastien Faucou

from arxiv, 26 pages, 10 figures, RTNS 2021

In order to satisfy timing constraints, modern real-time applications require massively parallel accelerators such as General Purpose Graphic Processing Units (GPGPUs). Generation after generation, the number of computing clusters made available in novel GPU architectures is steadily increasing, hence, investigating suitable scheduling approaches is now mandatory. Such scheduling approaches are related to mapping different and concurrent compute kernels within the GPU computing clusters, hence grouping GPU computing clusters into schedulable partitions. In this paper we propose novel techniques to define GPU partitions; this allows us to define suitable task-to-partition allocation mechanisms in which tasks are GPU compute kernels featuring different timing requirements. Such mechanisms will take into account the interference that GPU kernels experience when running in overlapping time windows. Hence, an effective and simple way to quantify the magnitude of such interference is also presented. We demonstrate the efficiency of the proposed approaches against the classical techniques that considered the GPU as a single, non-partitionable resource.

翻译：为了满足时间限制,现代实时应用需要大量平行的加速器,如通用图形处理器(GPGPUs)。一代又一代,在新型GPU结构中提供的计算集群数量正在稳步增加,因此,现在必须调查适当的排期办法。这种排期办法涉及在GPU计算组中绘制不同和同时的计算内核,从而将GPU计算组分组分组分为可排版分区。在本文件中,我们提出了界定GPU分区的新技术;这使我们能够确定适当的任务到部门分配机制,其中的任务是具有不同时间要求的GPU计算内核。这种机制将考虑到GPU内核在重叠时间窗口运行时所经历的干扰。因此,还提出了一种有效和简单的方式来量化这种干扰的程度。我们展示了针对将GPU视为单一、非可排版资源的传统技术的拟议办法的效率。

0

相关内容

GPU

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

专知会员服务

13+阅读 · 2020年6月8日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【O'Reilly AI Conference 2019】实时AI实体解析，Real-time AI for entity resolution ，Senzing 的创始人兼首席执行官Jeff Jonas

【O'Reilly AI Conference 2019】实时AI实体解析，Real-time AI for entity resolution ，Senzing 的创始人兼首席执行官Jeff Jonas

专知会员服务

10+阅读 · 2019年11月5日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

密歇根大学《20年目标检测综述》

密歇根大学《20年目标检测综述》

专知会员服务

99+阅读 · 2019年10月13日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

分布式并行架构Ray介绍

分布式并行架构Ray介绍

CreateAMind

10+阅读 · 2019年8月9日

计算机类 | PLDI 2020等国际会议信息6条

计算机类 | PLDI 2020等国际会议信息6条

Call4Papers

3+阅读 · 2019年7月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

计算机类 | ISCC 2019等国际会议信息9条

计算机类 | ISCC 2019等国际会议信息9条

Call4Papers

5+阅读 · 2018年12月25日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Real-Time Super-Resolution System of 4K-Video Based on Deep Learning

Arxiv

0+阅读 · 2021年7月12日

A Timecop's Chase Around the Table

Arxiv

0+阅读 · 2021年7月11日

Efficient Real-Time Image Recognition Using Collaborative Swarm of UAVs and Convolutional Networks

Arxiv

0+阅读 · 2021年7月9日

The Best of Many Worlds: Dual Mirror Descent for Online Allocation Problems

The Best of Many Worlds: Dual Mirror Descent for Online Allocation Problems

Arxiv

0+阅读 · 2021年7月9日

BEAT: Blockchain-Enabled Accountable and Transparent Network Sharing in 6G

BEAT: Blockchain-Enabled Accountable and Transparent Network Sharing in 6G

Arxiv

0+阅读 · 2021年7月9日

Resource Allocation Strategies for Real-Time Applications in Wi-Fi 7

Arxiv

0+阅读 · 2021年7月8日

Adaptive Methods for Real-World Domain Generalization

Arxiv

6+阅读 · 2021年3月30日

Accelerated Methods for Deep Reinforcement Learning

Accelerated Methods for Deep Reinforcement Learning

Arxiv

6+阅读 · 2019年1月10日

Near Real-time Hippocampus Segmentation Using Patch-based Canonical Neural Network

Near Real-time Hippocampus Segmentation Using Patch-based Canonical Neural Network

Arxiv

4+阅读 · 2018年7月15日

A Stochastic Decoder for Neural Machine Translation

Arxiv

5+阅读 · 2018年5月28日

VIP会员

文章信息

相关主题

相关VIP内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

专知会员服务

13+阅读 · 2020年6月8日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【O'Reilly AI Conference 2019】实时AI实体解析，Real-time AI for entity resolution ，Senzing 的创始人兼首席执行官Jeff Jonas

【O'Reilly AI Conference 2019】实时AI实体解析，Real-time AI for entity resolution ，Senzing 的创始人兼首席执行官Jeff Jonas

专知会员服务

10+阅读 · 2019年11月5日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

密歇根大学《20年目标检测综述》

密歇根大学《20年目标检测综述》

专知会员服务

99+阅读 · 2019年10月13日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《利用大语言模型（LLM）优化海军陆战队经验教训学习》2025年最新103页

《加拿大陆军顶层作战概念》2025最新33页

超越第一人称视角（FPV）无人机：汲取俄乌战争的全部教训

《瓦洛伦斯（ValoRens）项目 - 预测分析：解读敌方意图》

相关资讯

分布式并行架构Ray介绍

分布式并行架构Ray介绍

CreateAMind

10+阅读 · 2019年8月9日

计算机类 | PLDI 2020等国际会议信息6条

计算机类 | PLDI 2020等国际会议信息6条

Call4Papers

3+阅读 · 2019年7月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

计算机类 | ISCC 2019等国际会议信息9条

计算机类 | ISCC 2019等国际会议信息9条

Call4Papers

5+阅读 · 2018年12月25日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Real-Time Super-Resolution System of 4K-Video Based on Deep Learning

Arxiv

0+阅读 · 2021年7月12日

A Timecop's Chase Around the Table

Arxiv

0+阅读 · 2021年7月11日

Efficient Real-Time Image Recognition Using Collaborative Swarm of UAVs and Convolutional Networks

Arxiv

0+阅读 · 2021年7月9日

The Best of Many Worlds: Dual Mirror Descent for Online Allocation Problems

The Best of Many Worlds: Dual Mirror Descent for Online Allocation Problems

Arxiv

0+阅读 · 2021年7月9日

BEAT: Blockchain-Enabled Accountable and Transparent Network Sharing in 6G

BEAT: Blockchain-Enabled Accountable and Transparent Network Sharing in 6G

Arxiv

0+阅读 · 2021年7月9日

Resource Allocation Strategies for Real-Time Applications in Wi-Fi 7

Arxiv

0+阅读 · 2021年7月8日

Adaptive Methods for Real-World Domain Generalization

Arxiv

6+阅读 · 2021年3月30日

Accelerated Methods for Deep Reinforcement Learning

Accelerated Methods for Deep Reinforcement Learning

Arxiv

6+阅读 · 2019年1月10日

Near Real-time Hippocampus Segmentation Using Patch-based Canonical Neural Network

Near Real-time Hippocampus Segmentation Using Patch-based Canonical Neural Network

Arxiv

4+阅读 · 2018年7月15日

A Stochastic Decoder for Neural Machine Translation

Arxiv

5+阅读 · 2018年5月28日

微信扫码咨询专知VIP会员