通过正规化的Tensor核规范规范化实现多机构通信中信息集成多样化 (Diversifying Message Aggregation in Multi-Agent Communication via Normalized Tensor Nuclear Norm Regularization) - 专知论文

会员服务 ·

0

Tensor · 核范数 · 张量核范数 · 正则化项 · 图注意力网络 ·

2022 年 8 月 10 日

Diversifying Message Aggregation in Multi-Agent Communication via Normalized Tensor Nuclear Norm Regularization

翻译：通过正规化的Tensor核规范规范化实现多机构通信中信息集成多样化

Yuanzhao Zhai,Kele Xu,Bo Ding,Dawei Feng,Zijian Gao,Huaimin Wang

from arxiv, 11 pages, 9 figures

Aggregating messages is a key component for the communication of multi-agent reinforcement learning (Comm-MARL). Recently, it has witnessed the prevalence of graph attention networks (GAT) in Comm-MARL, where agents can be represented as nodes and messages can be aggregated via the weighted passing. While successful, GAT can lead to homogeneity in the strategies of message aggregation, and the "core" agent may excessively influence other agents' behaviors, which can severely limit the multi-agent coordination. To address this challenge, we first study the adjacency tensor of the communication graph and show that the homogeneity of message aggregation could be measured by the normalized tensor rank. Since the rank optimization problem is known to be NP-hard, we define a new nuclear norm to replace the rank, a convex surrogate of normalized tensor rank. Leveraging the norm, we further propose a plug-and-play regularizer on the adjacency tensor, named Normalized Tensor Nuclear Norm Regularization (NTNNR), to enrich the diversity of message aggregation actively. We extensively evaluate GAT with the proposed regularizer in both cooperative and mixed cooperative-competitive scenarios. The results demonstrate that aggregating messages using NTNNR-enhanced GAT can improve the efficiency of the training and achieve higher asymptotic performance than existing message aggregation methods. When NTNNR is applied to existing graph-attention Comm-MARL methods, we also observe significant performance improvements on the StarCraft II micromanagement benchmarks.

翻译：聚合信息是多试剂强化学习(Comm-MARL)交流的一个关键组成部分。最近,它见证了Comm-MARL中平面关注网络(GAT)的普及,在Comm-MARL中,代理商可以作为节点代表,信息可以通过加权传递进行汇总。虽然GAT成功,但GAT可以导致信息汇总战略的同质性,而“核心”代理商可能会过度影响其他代理商的行为,从而严重限制多试剂的协调。为了应对这一挑战,我们首先研究通信图的相近性强度,并表明信息汇总的同质性可以用正常的达氏级来衡量。由于级别优化问题已知为NP-硬性,我们定义了一个新的核规范,以取代等级,即电压的共振动,我们进一步提议在匹配性强力软调时设置一个插接和播放调节器,称为“正常的Tensority Temor” 规范化(NTNNNURRRRR),以合作性固度测量测量应用的信息的多样性。我们广泛评估了GAT的常规和高额培训方法。

0

相关内容

Tensor

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

肺循环肿瘤细胞分子表型鉴定

国家自然科学基金

0+阅读 · 2014年12月31日

多控磁性核-壳微球Fe3O4@MOFs/GO的构筑及载药性能的研究

国家自然科学基金

0+阅读 · 2013年12月31日

压缩感知LIDAR三维成像原理与方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

受限制策略下多臂Bandit过程的理论与应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

磁性微球固定化CA酶强化IVCAP工艺捕集CO2的应用基础研究

国家自然科学基金

0+阅读 · 2012年12月31日

多天线无线通信系统的鲁棒性设计

国家自然科学基金

2+阅读 · 2012年12月31日

磁性多层膜的可控离子液体电沉积及层间磁耦合

国家自然科学基金

0+阅读 · 2011年12月31日

miR-140在肿瘤转移中的作用及机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

高核稀土羟基簇合物的合成及性质研究

国家自然科学基金

0+阅读 · 2009年12月31日

离子液体中MCO3( M=Sr、Ba)反应结晶的相转移过程与晶体形态控制机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

Spectral clustering via adaptive layer aggregation for multi-layer networks

Arxiv

0+阅读 · 2022年10月6日

Cooperative Coverage with a Leader and a Wingmate in Communication-Constrained Environments

Arxiv

0+阅读 · 2022年10月6日

Active Learning for Regression with Aggregated Outputs

Arxiv

0+阅读 · 2022年10月4日

Agent swarms: cooperation and coordination under stringent communications constraint

Arxiv

0+阅读 · 2022年10月3日

Gradient Gating for Deep Multi-Rate Learning on Graphs

Arxiv

0+阅读 · 2022年10月2日

Privacy-preserving Decentralized Federated Learning over Time-varying Communication Graph

Arxiv

0+阅读 · 2022年10月1日

An Instance-Dependent Analysis for the Cooperative Multi-Player Multi-Armed Bandit

Arxiv

0+阅读 · 2022年9月30日

Momentum Tracking: Momentum Acceleration for Decentralized Deep Learning on Heterogeneous Data

Arxiv

0+阅读 · 2022年9月30日

Cooperative Beamforming Design for Multiple RIS-Assisted Communication Systems

Arxiv

0+阅读 · 2022年9月30日

Communicative Message Passing for Inductive Relation Reasoning

Communicative Message Passing for Inductive Relation Reasoning

Arxiv

11+阅读 · 2020年12月16日

VIP会员

文章信息

相关主题

张量核范数

图注意力网络

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】以人为中心的强化学习

任务规划与地形分析：现代复杂环境作战导航体系

认知优势：人工智能在国家安全决策中的核心作用

大模型赋能的具身智能：决策与具身学习综述

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Spectral clustering via adaptive layer aggregation for multi-layer networks

Arxiv

0+阅读 · 2022年10月6日

Cooperative Coverage with a Leader and a Wingmate in Communication-Constrained Environments

Arxiv

0+阅读 · 2022年10月6日

Active Learning for Regression with Aggregated Outputs

Arxiv

0+阅读 · 2022年10月4日

Agent swarms: cooperation and coordination under stringent communications constraint

Arxiv

0+阅读 · 2022年10月3日

Gradient Gating for Deep Multi-Rate Learning on Graphs

Arxiv

0+阅读 · 2022年10月2日

Privacy-preserving Decentralized Federated Learning over Time-varying Communication Graph

Arxiv

0+阅读 · 2022年10月1日

An Instance-Dependent Analysis for the Cooperative Multi-Player Multi-Armed Bandit

Arxiv

0+阅读 · 2022年9月30日

Momentum Tracking: Momentum Acceleration for Decentralized Deep Learning on Heterogeneous Data

Arxiv

0+阅读 · 2022年9月30日

Cooperative Beamforming Design for Multiple RIS-Assisted Communication Systems

Arxiv

0+阅读 · 2022年9月30日

Communicative Message Passing for Inductive Relation Reasoning

Communicative Message Passing for Inductive Relation Reasoning

Arxiv

11+阅读 · 2020年12月16日

相关基金

肺循环肿瘤细胞分子表型鉴定

国家自然科学基金

0+阅读 · 2014年12月31日

多控磁性核-壳微球Fe3O4@MOFs/GO的构筑及载药性能的研究

国家自然科学基金

0+阅读 · 2013年12月31日

压缩感知LIDAR三维成像原理与方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

受限制策略下多臂Bandit过程的理论与应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

磁性微球固定化CA酶强化IVCAP工艺捕集CO2的应用基础研究

国家自然科学基金

0+阅读 · 2012年12月31日

多天线无线通信系统的鲁棒性设计

国家自然科学基金

2+阅读 · 2012年12月31日

磁性多层膜的可控离子液体电沉积及层间磁耦合

国家自然科学基金

0+阅读 · 2011年12月31日

miR-140在肿瘤转移中的作用及机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

高核稀土羟基簇合物的合成及性质研究

国家自然科学基金

0+阅读 · 2009年12月31日

离子液体中MCO3( M=Sr、Ba)反应结晶的相转移过程与晶体形态控制机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员