Multi-agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation - 专知论文

会员服务 ·

0

线性的 · 泛函 · 近似 · Learning · 值迭代 ·

2023 年 5 月 10 日

Multi-agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation

翻译：暂无翻译

Yifei Min,Jiafan He,Tianhao Wang,Quanquan Gu

from arxiv, Published at the 40th International Conference on Machine Learning ( ICML 2023 )

We study multi-agent reinforcement learning in the setting of episodic Markov decision processes, where multiple agents cooperate via communication through a central server. We propose a provably efficient algorithm based on value iteration that enable asynchronous communication while ensuring the advantage of cooperation with low communication overhead. With linear function approximation, we prove that our algorithm enjoys an $\tilde{\mathcal{O}}(d^{3/2}H^2\sqrt{K})$ regret with $\tilde{\mathcal{O}}(dHM^2)$ communication complexity, where $d$ is the feature dimension, $H$ is the horizon length, $M$ is the total number of agents, and $K$ is the total number of episodes. We also provide a lower bound showing that a minimal $\Omega(dM)$ communication complexity is required to improve the performance through collaboration.

翻译：暂无翻译

0

相关内容

线性的

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

专知会员服务

234+阅读 · 2022年4月10日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

专知会员服务

41+阅读 · 2020年4月11日

深度强化学习方法及其在经济学中的应用综述，Comprehensive Review of Deep Reinforcement Learning Methods and Applicationsin Economic

深度强化学习方法及其在经济学中的应用综述，Comprehensive Review of Deep Reinforcement Learning Methods and Applicationsin Economic

专知会员服务

52+阅读 · 2020年4月7日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

161+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【KDD 2019|Tutorial】应用在交通中的强化学习 Deep Reinforcement Learning with Applications in Transportation，滴滴 AI Labs

【KDD 2019|Tutorial】应用在交通中的强化学习 Deep Reinforcement Learning with Applications in Transportation，滴滴 AI Labs

专知会员服务

65+阅读 · 2019年8月8日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

专知

25+阅读 · 2018年4月29日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

线粒体基因组来源的长非编码RNA通过与p53蛋白的相互作用调控细胞核基因的表达

国家自然科学基金

0+阅读 · 2016年12月31日

rs4969170GG基因型抑制SOCS3基因转录活性促进肝癌发生发展的功能机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

S3AGA样本（Spitzer-SDSS Spectral Atlas of Galaxies and AGNs)及其AGN研究

国家自然科学基金

0+阅读 · 2014年12月31日

共同分子机制的反向表达调控引发的反向生物学效应是神经退行性疾病降低癌症风险的重要原因

国家自然科学基金

0+阅读 · 2013年12月31日

CTHRC1基因遗传多态性与原发性胆汁性肝硬化的相关性研究及其功能鉴定

国家自然科学基金

0+阅读 · 2013年12月31日

长链非编码RNA-Cyren与舌鳞癌预后的关系及调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于交互智能体的山区公路交通环境关联机理与可拓性研究

国家自然科学基金

0+阅读 · 2012年12月31日

鼻NK/T 细胞淋巴瘤EB病毒微小RNA对细胞周期抑制基因表达调控及其与肿瘤演进关系

国家自然科学基金

0+阅读 · 2012年12月31日

三氧化二砷抑制人卵巢癌细胞中糖皮质激素受体表达的机制及生物学意义

国家自然科学基金

0+阅读 · 2011年12月31日

针灸治疗大鼠CD肠纤维化Smads与ERK-1/2MAPK信号通路Cross talk研究

国家自然科学基金

0+阅读 · 2009年12月31日

Asynchronous Algorithmic Alignment with Cocycles

Arxiv

0+阅读 · 2023年6月27日

Value-aware Importance Weighting for Off-policy Reinforcement Learning

Arxiv

0+阅读 · 2023年6月27日

Replicable Reinforcement Learning

Arxiv

0+阅读 · 2023年6月27日

The Sample Complexity of Approximate Rejection Sampling with Applications to Smoothed Online Learning

Arxiv

0+阅读 · 2023年6月27日

A Safe Genetic Algorithm Approach for Energy Efficient Federated Learning in Wireless Communication Networks

Arxiv

1+阅读 · 2023年6月25日

Qkd@Edge: Online Admission Control of Edge Applications with QKD-secured Communications

Arxiv

0+阅读 · 2023年6月25日

Cooperative Multi-Agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation

Arxiv

0+阅读 · 2023年6月24日

Reinforcement Federated Learning Method Based on Adaptive OPTICS Clustering

Arxiv

0+阅读 · 2023年6月23日

Reinforcement Learning on Graph: A Survey

Arxiv

67+阅读 · 2022年4月13日

Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization in e-Commercial Sponsored Search

Arxiv

12+阅读 · 2021年6月8日

VIP会员

文章信息

相关主题

相关VIP内容

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

专知会员服务

234+阅读 · 2022年4月10日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

专知会员服务

41+阅读 · 2020年4月11日

深度强化学习方法及其在经济学中的应用综述，Comprehensive Review of Deep Reinforcement Learning Methods and Applicationsin Economic

深度强化学习方法及其在经济学中的应用综述，Comprehensive Review of Deep Reinforcement Learning Methods and Applicationsin Economic

专知会员服务

52+阅读 · 2020年4月7日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

161+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【KDD 2019|Tutorial】应用在交通中的强化学习 Deep Reinforcement Learning with Applications in Transportation，滴滴 AI Labs

【KDD 2019|Tutorial】应用在交通中的强化学习 Deep Reinforcement Learning with Applications in Transportation，滴滴 AI Labs

专知会员服务

65+阅读 · 2019年8月8日

热门VIP内容

开通专知VIP会员享更多权益服务

智能体化人工智能：架构、应用及未来发展方向的综合综述

《自主武器》365页书籍

联邦学习综述：多层次聚合技术的系统分类、实验洞察与未来前沿

人工智能在空战中的局限及其真正适用领域

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

专知

25+阅读 · 2018年4月29日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Asynchronous Algorithmic Alignment with Cocycles

Arxiv

0+阅读 · 2023年6月27日

Value-aware Importance Weighting for Off-policy Reinforcement Learning

Arxiv

0+阅读 · 2023年6月27日

Replicable Reinforcement Learning

Arxiv

0+阅读 · 2023年6月27日

The Sample Complexity of Approximate Rejection Sampling with Applications to Smoothed Online Learning

Arxiv

0+阅读 · 2023年6月27日

A Safe Genetic Algorithm Approach for Energy Efficient Federated Learning in Wireless Communication Networks

Arxiv

1+阅读 · 2023年6月25日

Qkd@Edge: Online Admission Control of Edge Applications with QKD-secured Communications

Arxiv

0+阅读 · 2023年6月25日

Cooperative Multi-Agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation

Arxiv

0+阅读 · 2023年6月24日

Reinforcement Federated Learning Method Based on Adaptive OPTICS Clustering

Arxiv

0+阅读 · 2023年6月23日

Reinforcement Learning on Graph: A Survey

Arxiv

67+阅读 · 2022年4月13日

Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization in e-Commercial Sponsored Search

Arxiv

12+阅读 · 2021年6月8日

相关基金

线粒体基因组来源的长非编码RNA通过与p53蛋白的相互作用调控细胞核基因的表达

国家自然科学基金

0+阅读 · 2016年12月31日

rs4969170GG基因型抑制SOCS3基因转录活性促进肝癌发生发展的功能机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

S3AGA样本（Spitzer-SDSS Spectral Atlas of Galaxies and AGNs)及其AGN研究

国家自然科学基金

0+阅读 · 2014年12月31日

共同分子机制的反向表达调控引发的反向生物学效应是神经退行性疾病降低癌症风险的重要原因

国家自然科学基金

0+阅读 · 2013年12月31日

CTHRC1基因遗传多态性与原发性胆汁性肝硬化的相关性研究及其功能鉴定

国家自然科学基金

0+阅读 · 2013年12月31日

长链非编码RNA-Cyren与舌鳞癌预后的关系及调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于交互智能体的山区公路交通环境关联机理与可拓性研究

国家自然科学基金

0+阅读 · 2012年12月31日

鼻NK/T 细胞淋巴瘤EB病毒微小RNA对细胞周期抑制基因表达调控及其与肿瘤演进关系

国家自然科学基金

0+阅读 · 2012年12月31日

三氧化二砷抑制人卵巢癌细胞中糖皮质激素受体表达的机制及生物学意义

国家自然科学基金

0+阅读 · 2011年12月31日

针灸治疗大鼠CD肠纤维化Smads与ERK-1/2MAPK信号通路Cross talk研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员