非边协调图 (Non-Linear Coordination Graphs) - 专知论文

会员服务 ·

0

泛函 · 价值函数 · 图 · 优化器 · Learning ·

2022 年 10 月 26 日

Non-Linear Coordination Graphs

翻译：非边协调图

Yipeng Kang,Tonghan Wang,Xiaoran Wu,Qianlan Yang,Chongjie Zhang

from arxiv, Authors are listed in alphabetical order

Value decomposition multi-agent reinforcement learning methods learn the global value function as a mixing of each agent's individual utility functions. Coordination graphs (CGs) represent a higher-order decomposition by incorporating pairwise payoff functions and thus is supposed to have a more powerful representational capacity. However, CGs decompose the global value function linearly over local value functions, severely limiting the complexity of the value function class that can be represented. In this paper, we propose the first non-linear coordination graph by extending CG value decomposition beyond the linear case. One major challenge is to conduct greedy action selections in this new function class to which commonly adopted DCOP algorithms are no longer applicable. We study how to solve this problem when mixing networks with LeakyReLU activation are used. An enumeration method with a global optimality guarantee is proposed and motivates an efficient iterative optimization method with a local optimality guarantee. We find that our method can achieve superior performance on challenging multi-agent coordination tasks like MACO.

翻译：多剂强化的数值分解学习方法学习全球价值功能,将每种物剂的个别效用功能混合在一起。协调图表(CGs)通过纳入双向报酬功能,代表了更高层次的分解,因此应该具有更强大的代表能力。然而,CGs将全球价值函数的线性分解超过当地价值功能,严重限制了所代表价值功能类别的复杂性。在本文件中,我们提议第一个非线性协调图,将CG值分解范围扩大到线性案例之外。一个重大挑战是在这一新功能类别中进行贪婪的行动选择,而通常采用的DCOP算法已不再适用。我们研究如何在将网络与LekyReLU的激活混合时解决这一问题。提出了具有全球最佳性保证的计数方法,并激励一种具有地方最佳性保证的高效迭代优化方法。我们发现,我们的方法可以在挑战性多剂协调任务(如MACO)上取得优异性业绩。

0

相关内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

MARVELD1基因调控肝细胞癌介入治疗的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

基底型乳腺癌干细胞信号传导网络结构建模

国家自然科学基金

0+阅读 · 2014年12月31日

EB病毒ncRNA在Burkitt淋巴瘤发病中的作用及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kindlin-2在上皮性卵巢癌中调节雌激素受体α表达的作用及其机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

莱菔硫烷在脊髓损伤后抑制神经细胞Pyroptosis的作用及其机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

IL-32/Integrins/FAK通路在肝纤维化形成中的作用研究

国家自然科学基金

0+阅读 · 2013年12月31日

lincRNA-ETS1-2上调癌基因ETS-1表达促进雄激素非依赖性前列腺癌演进的机制

国家自然科学基金

0+阅读 · 2012年12月31日

HBx蛋白调控miRNA介导乙型肝炎病毒天然免疫逃逸的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

脱－γ－羧基凝血酶原(Des-γ-carboxyl prothrombin DCP)促进肝癌恶性增殖与转移作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

miR-130b在肝细胞癌发病中的作用及其表达调控机制

国家自然科学基金

0+阅读 · 2011年12月31日

Learning Dynamical Systems From Invariant Measures

Arxiv

0+阅读 · 2023年1月12日

Approximate Information States for Worst-Case Control and Learning in Uncertain Systems

Arxiv

0+阅读 · 2023年1月12日

Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence

Arxiv

0+阅读 · 2023年1月10日

Structural risk minimization for quantum linear classifiers

Arxiv

0+阅读 · 2023年1月10日

Concentration of measure and generalized product of random vectors with an application to Hanson-Wright-like inequalities

Arxiv

0+阅读 · 2023年1月10日

Survey on Graph Neural Network Acceleration: An Algorithmic Perspective

Arxiv

12+阅读 · 2022年2月10日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Graph Neural Networks for Recommender Systems: Challenges, Methods, and Directions

Arxiv

31+阅读 · 2021年9月27日

Temporal Graph Networks for Deep Learning on Dynamic Graphs

Arxiv

37+阅读 · 2020年10月9日

Learning to Propagate for Graph Meta-Learning

Arxiv

14+阅读 · 2019年9月11日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《美空军条令出版物：战略打击》最新条令

《高能激光武器》22页slides

军事前沿模型

《面向小型无人机或无人飞行器的创新雷达探测与人工智能分类技术》263页

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Learning Dynamical Systems From Invariant Measures

Arxiv

0+阅读 · 2023年1月12日

Approximate Information States for Worst-Case Control and Learning in Uncertain Systems

Arxiv

0+阅读 · 2023年1月12日

Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence

Arxiv

0+阅读 · 2023年1月10日

Structural risk minimization for quantum linear classifiers

Arxiv

0+阅读 · 2023年1月10日

Concentration of measure and generalized product of random vectors with an application to Hanson-Wright-like inequalities

Arxiv

0+阅读 · 2023年1月10日

Survey on Graph Neural Network Acceleration: An Algorithmic Perspective

Arxiv

12+阅读 · 2022年2月10日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Graph Neural Networks for Recommender Systems: Challenges, Methods, and Directions

Arxiv

31+阅读 · 2021年9月27日

Temporal Graph Networks for Deep Learning on Dynamic Graphs

Arxiv

37+阅读 · 2020年10月9日

Learning to Propagate for Graph Meta-Learning

Arxiv

14+阅读 · 2019年9月11日

相关基金

MARVELD1基因调控肝细胞癌介入治疗的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

基底型乳腺癌干细胞信号传导网络结构建模

国家自然科学基金

0+阅读 · 2014年12月31日

EB病毒ncRNA在Burkitt淋巴瘤发病中的作用及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kindlin-2在上皮性卵巢癌中调节雌激素受体α表达的作用及其机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

莱菔硫烷在脊髓损伤后抑制神经细胞Pyroptosis的作用及其机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

IL-32/Integrins/FAK通路在肝纤维化形成中的作用研究

国家自然科学基金

0+阅读 · 2013年12月31日

lincRNA-ETS1-2上调癌基因ETS-1表达促进雄激素非依赖性前列腺癌演进的机制

国家自然科学基金

0+阅读 · 2012年12月31日

HBx蛋白调控miRNA介导乙型肝炎病毒天然免疫逃逸的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

脱－γ－羧基凝血酶原(Des-γ-carboxyl prothrombin DCP)促进肝癌恶性增殖与转移作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

miR-130b在肝细胞癌发病中的作用及其表达调控机制

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员