与Tima Inform Inforcation Bandwidth有限公司下的图形信息瓶颈下的多剂通信(立场文件) (Multi-agent Communication with Graph Information Bottleneck under Limited Bandwidth (a position paper)) - 专知论文

会员服务 ·

0

INFORMS · 图 · 完全图 · Performer · Extensibility ·

2021 年 12 月 20 日

Multi-agent Communication with Graph Information Bottleneck under Limited Bandwidth (a position paper)

翻译：与Tima Inform Inforcation Bandwidth有限公司下的图形信息瓶颈下的多剂通信(立场文件)

Qi Tian,Kun Kuang,Baoxiang Wang,Furui Liu,Fei Wu

Recent studies have shown that introducing communication between agents can significantly improve overall performance in cooperative Multi-agent reinforcement learning (MARL). In many real-world scenarios, communication can be expensive and the bandwidth of the multi-agent system is subject to certain constraints. Redundant messages who occupy the communication resources can block the transmission of informative messages and thus jeopardize the performance. In this paper, we aim to learn the minimal sufficient communication messages. First, we initiate the communication between agents by a complete graph. Then we introduce the graph information bottleneck (GIB) principle into this complete graph and derive the optimization over graph structures. Based on the optimization, a novel multi-agent communication module, called CommGIB, is proposed, which effectively compresses the structure information and node information in the communication graph to deal with bandwidth-constrained settings. Extensive experiments in Traffic Control and StanCraft II are conducted. The results indicate that the proposed methods can achieve better performance in bandwidth-restricted settings compared with state-of-the-art algorithms, with especially large margins in large-scale multi-agent tasks.

翻译：最近的研究显示,在多剂强化合作学习(MARL)中,采用代理商之间的通信可以大大改善合作性多剂强化学习的总体绩效。在许多现实世界情景中,通信费用昂贵,多剂系统的带宽受到某些限制。占用通信资源的多余信息可以阻断信息传递,从而危及性能。在本文中,我们的目标是通过一个完整的图表来学习最低限度的充分通信信息。首先,我们通过一个完整的图表来启动代理商之间的通信。然后,我们在这个完整的图表中引入图形信息瓶颈原则,并在图形结构上进行优化。在优化的基础上,提出了一个新的多剂通信模块,称为CommGIB,它有效地压缩结构信息和通信图中的节点信息,以应对带宽限制的环境。在交通控制和斯坦克拉夫二号上进行了广泛的实验。结果显示,拟议的方法可以在带宽度限制环境中实现更好的性能,与最先进的算法相比,在大型多剂任务中特别有较大的利润。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

【WWW2021】REST:关系事件驱动的股票趋势预测

【WWW2021】REST:关系事件驱动的股票趋势预测

专知会员服务

34+阅读 · 2021年3月9日

属性异质信息网络上的半监督双聚类

专知会员服务

30+阅读 · 2021年2月17日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

54+阅读 · 2021年1月20日

【论文推荐】层次知识图谱，Hierarchical Knowledge Graphs: A Novel Information Representation for Exploratory Search Tasks

【论文推荐】层次知识图谱，Hierarchical Knowledge Graphs: A Novel Information Representation for Exploratory Search Tasks

专知会员服务

49+阅读 · 2020年5月26日

具有组合核的图神经网络，Graph Neural Networks with Composite Kernels

具有组合核的图神经网络，Graph Neural Networks with Composite Kernels

专知会员服务

59+阅读 · 2020年5月20日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

Permutation-equivariant and Proximity-aware Graph Neural Networks with Stochastic Message Passing

Arxiv

0+阅读 · 2022年2月22日

A Decentralized Communication Framework based on Dual-Level Recurrence for Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2022年2月22日

A Globally Convergent Evolutionary Strategy for Stochastic Constrained Optimization with Applications to Reinforcement Learning

Arxiv

0+阅读 · 2022年2月21日

Multi-Agent Reinforcement Learning for Network Selection and Resource Allocation in Heterogeneous multi-RAT Networks

Multi-Agent Reinforcement Learning for Network Selection and Resource Allocation in Heterogeneous multi-RAT Networks

Arxiv

0+阅读 · 2022年2月21日

Task-oriented Scheduling for Networked Control Systems: An Age of Information-Aware Implementation on Software-defined Radios

Arxiv

0+阅读 · 2022年2月21日

Learning Multi-agent Action Coordination via Electing First-move Agent

Arxiv

0+阅读 · 2022年2月19日

Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization in e-Commercial Sponsored Search

Arxiv

12+阅读 · 2021年6月8日

Graph Information Bottleneck for Subgraph Recognition

Arxiv

8+阅读 · 2020年10月12日

Heterogeneous Relational Reasoning in Knowledge Graphs with Reinforcement Learning

Heterogeneous Relational Reasoning in Knowledge Graphs with Reinforcement Learning

Arxiv

10+阅读 · 2020年3月12日

Accelerated Randomized Coordinate Descent Algorithms for Stochastic Optimization and Online Learning

Arxiv

9+阅读 · 2018年7月16日

VIP会员

文章信息

相关主题

相关VIP内容

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

【WWW2021】REST:关系事件驱动的股票趋势预测

【WWW2021】REST:关系事件驱动的股票趋势预测

专知会员服务

34+阅读 · 2021年3月9日

属性异质信息网络上的半监督双聚类

专知会员服务

30+阅读 · 2021年2月17日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

54+阅读 · 2021年1月20日

【论文推荐】层次知识图谱，Hierarchical Knowledge Graphs: A Novel Information Representation for Exploratory Search Tasks

【论文推荐】层次知识图谱，Hierarchical Knowledge Graphs: A Novel Information Representation for Exploratory Search Tasks

专知会员服务

49+阅读 · 2020年5月26日

具有组合核的图神经网络，Graph Neural Networks with Composite Kernels

具有组合核的图神经网络，Graph Neural Networks with Composite Kernels

专知会员服务

59+阅读 · 2020年5月20日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】《知识图谱与大语言模型的协同应用》，544页pdf

军事通信系统：安全行动的支柱

《缓解大语言模型（LLMs）幻觉：面向应用的检索增强生成（RAG）、推理与智能体系统综述》

【新书】机器学习系统，2620页pdf

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

相关论文

Permutation-equivariant and Proximity-aware Graph Neural Networks with Stochastic Message Passing

Arxiv

0+阅读 · 2022年2月22日

A Decentralized Communication Framework based on Dual-Level Recurrence for Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2022年2月22日

A Globally Convergent Evolutionary Strategy for Stochastic Constrained Optimization with Applications to Reinforcement Learning

Arxiv

0+阅读 · 2022年2月21日

Multi-Agent Reinforcement Learning for Network Selection and Resource Allocation in Heterogeneous multi-RAT Networks

Multi-Agent Reinforcement Learning for Network Selection and Resource Allocation in Heterogeneous multi-RAT Networks

Arxiv

0+阅读 · 2022年2月21日

Task-oriented Scheduling for Networked Control Systems: An Age of Information-Aware Implementation on Software-defined Radios

Arxiv

0+阅读 · 2022年2月21日

Learning Multi-agent Action Coordination via Electing First-move Agent

Arxiv

0+阅读 · 2022年2月19日

Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization in e-Commercial Sponsored Search

Arxiv

12+阅读 · 2021年6月8日

Graph Information Bottleneck for Subgraph Recognition

Arxiv

8+阅读 · 2020年10月12日

Heterogeneous Relational Reasoning in Knowledge Graphs with Reinforcement Learning

Heterogeneous Relational Reasoning in Knowledge Graphs with Reinforcement Learning

Arxiv

10+阅读 · 2020年3月12日

Accelerated Randomized Coordinate Descent Algorithms for Stochastic Optimization and Online Learning

Arxiv

9+阅读 · 2018年7月16日

微信扫码咨询专知VIP会员