虚拟链接:一个可缩放的多生产器,多客户信息信息跨核心通信的队列结构 (Virtual-Link: A Scalable Multi-Producer, Multi-Consumer Message Queue Architecture for Cross-Core Communication) - 专知论文

会员服务 ·

0

可约的 · Performer · state-of-the-art · 流 · FAST ·

2021 年 1 月 19 日

Virtual-Link: A Scalable Multi-Producer, Multi-Consumer Message Queue Architecture for Cross-Core Communication

翻译：虚拟链接:一个可缩放的多生产器,多客户信息信息跨核心通信的队列结构

Qinzhe Wu,Jonathan Beard,Ashen Ekanayake,Andreas Gerstlauer,Lizy K. John

Cross-core communication is increasingly a bottleneck as the number of processing elements increase per system-on-chip. Typical hardware solutions to cross-core communication are often inflexible; while software solutions are flexible, they have performance scaling limitations. A key problem, as we will show, is that of shared state in software-based message queue mechanisms. This paper proposes Virtual-Link (VL), a novel light-weight communication mechanism with hardware support to facilitate M:N lock-free data movement. VL reduces the amount of coherent shared state, which is a bottleneck for many approaches, to zero. VL provides further latency benefit by keeping data on the fast path (i.e., within the on-chip interconnect). VL enables directed cache-injection (stashing) between PEs on the coherence bus, reducing the latency for core-to-core communication. VL is particularly effective for fine-grain tasks on streaming data. Evaluation on a full system simulator with 7 benchmarks shows that VL achieves a 2.09x speedup over state-of-the-art software-based communication mechanisms, while reducing memory traffic by 61%.

翻译：跨核心通信日益成为瓶颈,因为处理元素的数量会增加每个系统-芯片的处理元素数量。跨核心通信的典型硬件解决方案往往不灵活;软件解决方案虽然灵活,但有绩效缩放限制。我们将表明,一个关键问题是基于软件的信息队列机制中的共享状态。本文提出虚拟链接(VL),这是一个具有硬件支持的新颖的轻量通信机制,有硬件支持,有利于M:N无锁数据移动。VL将一致共享状态的数量减少到零。VL通过将数据保存在快速路径上(即在芯片互联中)提供进一步的延缓效益。VL能够让个人在基于软件的连接中进行定向缓冲(缓冲),降低核心-核心通信的惯性。VL对于跟踪数据的细微任务特别有效。对7个基准的全系统模拟器的评估显示,VL在降低状态-节流读软件的通信机制的同时,实现了2.09x速度超过状态-61 %的存储器通信机制。

0

相关内容

可约的

【经典书】Linux UNIX系统编程手册，1554页pdf

【经典书】Linux UNIX系统编程手册，1554页pdf

专知会员服务

48+阅读 · 2021年2月20日

【干货书】Python程序员编程，810页pdf，Python® for Programmers

【干货书】Python程序员编程，810页pdf，Python® for Programmers

专知会员服务

62+阅读 · 2020年8月6日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【Manning新书】微服务安全实战，616页pdf，Microservices Security in Action

【Manning新书】微服务安全实战，616页pdf，Microservices Security in Action

专知会员服务

46+阅读 · 2020年7月22日

商业数据分析，39页ppt

商业数据分析，39页ppt

专知会员服务

165+阅读 · 2020年6月2日

【阿里巴巴达摩院】TResNet: 高性能的GPU专用架构，GPU-Dedicated Architecture

【阿里巴巴达摩院】TResNet: 高性能的GPU专用架构，GPU-Dedicated Architecture

专知会员服务

33+阅读 · 2020年4月1日

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

专知会员服务

31+阅读 · 2019年11月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【电子书推荐】Data Science with Python and Dask

【电子书推荐】Data Science with Python and Dask

专知会员服务

44+阅读 · 2019年6月1日

LibRec 精选：AutoML for Contextual Bandits

LibRec 精选：AutoML for Contextual Bandits

LibRec智能推荐

7+阅读 · 2019年9月19日

CCF推荐 | 国际会议信息6条

CCF推荐 | 国际会议信息6条

Call4Papers

9+阅读 · 2019年8月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

LibRec 精选：位置感知的长序列会话推荐

LibRec 精选：位置感知的长序列会话推荐

LibRec智能推荐

3+阅读 · 2019年5月17日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

carla 学习笔记

carla 学习笔记

CreateAMind

9+阅读 · 2018年2月7日

Understanding the effect of hyperparameter optimization on machine learning models for structure design problems

Arxiv

0+阅读 · 2021年3月15日

A novel approach for the efficient modeling of material dissolution in electrochemical machining

Arxiv

0+阅读 · 2021年3月15日

From Nano-Communications to Body Area Networks: A Perspective on Truly Personal Communications

Arxiv

0+阅读 · 2021年3月12日

Robofleet: Secure Open Source Communication and Management for Fleets of Autonomous Robots

Arxiv

0+阅读 · 2021年3月11日

Scalable Bayesian Inverse Reinforcement Learning

Arxiv

0+阅读 · 2021年3月11日

Neural Architecture Generator Optimization

Arxiv

6+阅读 · 2020年10月8日

On Layer Normalization in the Transformer Architecture

Arxiv

4+阅读 · 2020年2月12日

IRLAS: Inverse Reinforcement Learning for Architecture Search

IRLAS: Inverse Reinforcement Learning for Architecture Search

Arxiv

4+阅读 · 2018年12月14日

Latent Multi-task Architecture Learning

Arxiv

3+阅读 · 2018年11月19日

Efficient and Effective $L_0$ Feature Selection

Efficient and Effective $L_0$ Feature Selection

Arxiv

5+阅读 · 2018年8月7日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

【经典书】Linux UNIX系统编程手册，1554页pdf

【经典书】Linux UNIX系统编程手册，1554页pdf

专知会员服务

48+阅读 · 2021年2月20日

【干货书】Python程序员编程，810页pdf，Python® for Programmers

【干货书】Python程序员编程，810页pdf，Python® for Programmers

专知会员服务

62+阅读 · 2020年8月6日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【Manning新书】微服务安全实战，616页pdf，Microservices Security in Action

【Manning新书】微服务安全实战，616页pdf，Microservices Security in Action

专知会员服务

46+阅读 · 2020年7月22日

商业数据分析，39页ppt

商业数据分析，39页ppt

专知会员服务

165+阅读 · 2020年6月2日

【阿里巴巴达摩院】TResNet: 高性能的GPU专用架构，GPU-Dedicated Architecture

【阿里巴巴达摩院】TResNet: 高性能的GPU专用架构，GPU-Dedicated Architecture

专知会员服务

33+阅读 · 2020年4月1日

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

专知会员服务

31+阅读 · 2019年11月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【电子书推荐】Data Science with Python and Dask

【电子书推荐】Data Science with Python and Dask

专知会员服务

44+阅读 · 2019年6月1日

热门VIP内容

开通专知VIP会员享更多权益服务

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

相关资讯

LibRec 精选：AutoML for Contextual Bandits

LibRec 精选：AutoML for Contextual Bandits

LibRec智能推荐

7+阅读 · 2019年9月19日

CCF推荐 | 国际会议信息6条

CCF推荐 | 国际会议信息6条

Call4Papers

9+阅读 · 2019年8月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

LibRec 精选：位置感知的长序列会话推荐

LibRec 精选：位置感知的长序列会话推荐

LibRec智能推荐

3+阅读 · 2019年5月17日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

carla 学习笔记

carla 学习笔记

CreateAMind

9+阅读 · 2018年2月7日

相关论文

Understanding the effect of hyperparameter optimization on machine learning models for structure design problems

Arxiv

0+阅读 · 2021年3月15日

A novel approach for the efficient modeling of material dissolution in electrochemical machining

Arxiv

0+阅读 · 2021年3月15日

From Nano-Communications to Body Area Networks: A Perspective on Truly Personal Communications

Arxiv

0+阅读 · 2021年3月12日

Robofleet: Secure Open Source Communication and Management for Fleets of Autonomous Robots

Arxiv

0+阅读 · 2021年3月11日

Scalable Bayesian Inverse Reinforcement Learning

Arxiv

0+阅读 · 2021年3月11日

Neural Architecture Generator Optimization

Arxiv

6+阅读 · 2020年10月8日

On Layer Normalization in the Transformer Architecture

Arxiv

4+阅读 · 2020年2月12日

IRLAS: Inverse Reinforcement Learning for Architecture Search

IRLAS: Inverse Reinforcement Learning for Architecture Search

Arxiv

4+阅读 · 2018年12月14日

Latent Multi-task Architecture Learning

Arxiv

3+阅读 · 2018年11月19日

Efficient and Effective $L_0$ Feature Selection

Efficient and Effective $L_0$ Feature Selection

Arxiv

5+阅读 · 2018年8月7日

微信扫码咨询专知VIP会员