Translated title：一个新颖的基于点的算法来应用公共信息方法进行多智能体控制 (A Novel Point-based Algorithm for Multi-agent Control Using the Common Information Approach) - 专知论文

会员服务 ·

0

智能体 · 随机控制 · 多智能体 · 控制问题 · 算法 ·

2023 年 4 月 10 日

A Novel Point-based Algorithm for Multi-agent Control Using the Common Information Approach

翻译：Translated title：一个新颖的基于点的算法来应用公共信息方法进行多智能体控制

Dengwang Tang,Ashutosh Nayyar,Rahul Jain

from arxiv, 11 pages, 4 figures

The Common Information (CI) approach provides a systematic way to transform a multi-agent stochastic control problem to a single-agent partially observed Markov decision problem (POMDP) called the coordinator's POMDP. However, such a POMDP can be hard to solve due to its extraordinarily large action space. We propose a new algorithm for multi-agent stochastic control problems, called coordinator's heuristic search value iteration (CHSVI), that combines the CI approach and point-based POMDP algorithms for large action spaces. We demonstrate the algorithm through optimally solving several benchmark problems.

翻译：Translated abstract：公共信息（CI）方法提供了一种将多智能体随机控制问题转化为单一智能体部分观测的马尔可夫决策问题（POMDP）的系统性方法，名为协调者的POMDP。然而，由于其异常庞大的动作空间，这样的POMDP可能很难解决。我们提出了一种新的多智能体随机控制问题算法，称为协调者启发式搜索值迭代（CHSVI），将CI方法和用于大型动作空间的基于点的POMDP算法结合起来。我们通过最优求解几个基准问题来演示该算法的效果。

0

相关内容

智能体

智能体，顾名思义，就是具有智能的实体，英文名是Agent。

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

专知会员服务

60+阅读 · 2022年5月5日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

专知会员服务

52+阅读 · 2020年5月26日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

有理映射的参数空间

国家自然科学基金

0+阅读 · 2013年12月31日

一种含有复杂外形运动物体的高效IB-LBM算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

靶向微管蛋白秋水仙碱位点的白藜芦醇-Combrestatin A-4类抑制剂的设计、合成及活性研究

国家自然科学基金

0+阅读 · 2013年12月31日

非线性系统优化控制的数值解法统一框架及滑模后退时域控制算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

de novo预测蛋白质结构的并行元启发方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

A Practical Algorithm with Performance Guarantees for the Art Gallery Problem

Arxiv

0+阅读 · 2023年5月26日

Automatic Construction of Parallel Algorithm Portfolios for Multi-objective Optimization

Arxiv

0+阅读 · 2023年5月26日

Accelerated solutions of convection-dominated partial differential equations using implicit feature tracking and empirical quadrature

Arxiv

0+阅读 · 2023年5月25日

Learning Reward Machines in Cooperative Multi-Agent Tasks

Arxiv

0+阅读 · 2023年5月24日

Harmonic Measures and Numerical Computation of Cauchy Problems for Laplace Equations

Arxiv

0+阅读 · 2023年5月24日

VIP会员

文章信息

相关主题

相关VIP内容

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

专知会员服务

60+阅读 · 2022年5月5日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

专知会员服务

52+阅读 · 2020年5月26日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

人机协同时代的军事指挥控制演进

《英国智库：瓦解俄罗斯防空系统生产，夺回制空权》最新报告

《通过仿真与开源数据提升战略决策：机遇与局限》最新报告

《战术突击工具包：军队的“边缘”操作系统》报告

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

A Practical Algorithm with Performance Guarantees for the Art Gallery Problem

Arxiv

0+阅读 · 2023年5月26日

Automatic Construction of Parallel Algorithm Portfolios for Multi-objective Optimization

Arxiv

0+阅读 · 2023年5月26日

Accelerated solutions of convection-dominated partial differential equations using implicit feature tracking and empirical quadrature

Arxiv

0+阅读 · 2023年5月25日

Learning Reward Machines in Cooperative Multi-Agent Tasks

Arxiv

0+阅读 · 2023年5月24日

Harmonic Measures and Numerical Computation of Cauchy Problems for Laplace Equations

Arxiv

0+阅读 · 2023年5月24日

相关基金

有理映射的参数空间

国家自然科学基金

0+阅读 · 2013年12月31日

一种含有复杂外形运动物体的高效IB-LBM算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

靶向微管蛋白秋水仙碱位点的白藜芦醇-Combrestatin A-4类抑制剂的设计、合成及活性研究

国家自然科学基金

0+阅读 · 2013年12月31日

非线性系统优化控制的数值解法统一框架及滑模后退时域控制算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

de novo预测蛋白质结构的并行元启发方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员