网络化Markov潜在运动会中本地化行动者-评论家的趋同率</s> (Convergence Rates for Localized Actor-Critic in Networked Markov Potential Games) - 专知论文

会员服务 ·

0

Agent · Markov · Networking · 泛函 · 近似 ·

2023 年 3 月 8 日

Convergence Rates for Localized Actor-Critic in Networked Markov Potential Games

翻译：网络化Markov潜在运动会中本地化行动者-评论家的趋同率

Zhaoyi Zhou,Zaiwei Chen,Yiheng Lin,Adam Wierman

We introduce a class of networked Markov potential games where agents are associated with nodes in a network. Each agent has its own local potential function, and the reward of each agent depends only on the states and actions of agents within a $\kappa$-hop neighborhood. In this context, we propose a localized actor-critic algorithm. The algorithm is scalable since each agent uses only local information and does not need access to the global state. Further, the algorithm overcomes the curse of dimensionality through the use of function approximation. Our main results provide finite-sample guarantees up to a localization error and a function approximation error. Specifically, we achieve an $\tilde{\mathcal{O}}(\epsilon^{-4})$ sample complexity measured by the averaged Nash regret. This is the first finite-sample bound for multi-agent competitive games that does not depend on the number of agents.

翻译：我们引入了一组网络化的Markov潜在游戏, 其中代理商与网络中的节点相关联。每个代理商都有自己的本地潜在功能, 每个代理商的奖赏只取决于在$\kappa$-hop 邻里代理商的状态和行动。在这方面, 我们提出一个本地化的演算算法。算法是可以缩放的, 因为每个代理商只使用本地信息, 不需要进入全球状态。此外, 算法通过使用函数近似来克服了维度的诅咒。我们的主要结果为本地化错误和函数近似错误提供了有限的保证。具体地说, 我们实现了按平均纳什遗憾度测量的 $\ tilde\ mathcal{O{(\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\</s>

0

相关内容

Agent

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

神经系统seipin缺失诱发精神迟滞的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

LCA视角下宏观建筑碳排放结构特性、演化机理及区域化减排机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Haccpper环境中不锈钢表面活性与电化学噪声特征研究

国家自然科学基金

0+阅读 · 2012年12月31日

交通污染影响DNA甲基化及组蛋白乙酰化的表观遗传机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

Skp2-p27信号通路在卵巢早衰发病中的作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于景观格局演变的鄱阳湖典型流域水环境响应及其优化模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

深色有隔内生真菌（DSE）重金属抗性机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

PI-IBS中TMEM16A介导IL-4对Cajal细胞损伤的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

On-demand Mobility-as-a-Service platform assignment games with guaranteed stable outcomes

Arxiv

0+阅读 · 2023年5月1日

Faster Submodular Maximization for Several Classes of Matroids

Arxiv

0+阅读 · 2023年4月28日

Spectral Sparsification for Communication-Efficient Collaborative Rotation and Translation Estimation

Arxiv

0+阅读 · 2023年4月28日

Local-Global Transformer Enhanced Unfolding Network for Pan-sharpening

Arxiv

0+阅读 · 2023年4月28日

A Best-of-Both-Worlds Algorithm for Constrained MDPs with Long-Term Constraints

Arxiv

0+阅读 · 2023年4月27日

A Distributed Online Optimization Strategy for Cooperative Robotic Surveillance

Arxiv

0+阅读 · 2023年4月27日

Local Policy Improvement for Recommender Systems

Arxiv

0+阅读 · 2023年4月26日

Limited-Trust in Social Network Games

Arxiv

0+阅读 · 2023年4月26日

Decentralized and Communication-Free Multi-Robot Navigation through Distributed Games

Arxiv

40+阅读 · 2021年9月15日

The Confluence of Networks, Games and Learning

Arxiv

94+阅读 · 2021年5月17日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

On-demand Mobility-as-a-Service platform assignment games with guaranteed stable outcomes

Arxiv

0+阅读 · 2023年5月1日

Faster Submodular Maximization for Several Classes of Matroids

Arxiv

0+阅读 · 2023年4月28日

Spectral Sparsification for Communication-Efficient Collaborative Rotation and Translation Estimation

Arxiv

0+阅读 · 2023年4月28日

Local-Global Transformer Enhanced Unfolding Network for Pan-sharpening

Arxiv

0+阅读 · 2023年4月28日

A Best-of-Both-Worlds Algorithm for Constrained MDPs with Long-Term Constraints

Arxiv

0+阅读 · 2023年4月27日

A Distributed Online Optimization Strategy for Cooperative Robotic Surveillance

Arxiv

0+阅读 · 2023年4月27日

Local Policy Improvement for Recommender Systems

Arxiv

0+阅读 · 2023年4月26日

Limited-Trust in Social Network Games

Arxiv

0+阅读 · 2023年4月26日

Decentralized and Communication-Free Multi-Robot Navigation through Distributed Games

Arxiv

40+阅读 · 2021年9月15日

The Confluence of Networks, Games and Learning

Arxiv

94+阅读 · 2021年5月17日

相关基金

神经系统seipin缺失诱发精神迟滞的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

LCA视角下宏观建筑碳排放结构特性、演化机理及区域化减排机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Haccpper环境中不锈钢表面活性与电化学噪声特征研究

国家自然科学基金

0+阅读 · 2012年12月31日

交通污染影响DNA甲基化及组蛋白乙酰化的表观遗传机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

Skp2-p27信号通路在卵巢早衰发病中的作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于景观格局演变的鄱阳湖典型流域水环境响应及其优化模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

深色有隔内生真菌（DSE）重金属抗性机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

PI-IBS中TMEM16A介导IL-4对Cajal细胞损伤的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员