针对多机构通信问题和频道利率限制的基于任务的信息压缩 (Task-Based Information Compression for Multi-Agent Communication Problems with Channel Rate Constraints) - 专知论文

会员服务 ·

0

INFORMS · Performer · 极小点 · 约束 · 查准率/准确率 ·

2021 年 8 月 16 日

Task-Based Information Compression for Multi-Agent Communication Problems with Channel Rate Constraints

翻译：针对多机构通信问题和频道利率限制的基于任务的信息压缩

Arsham Mostaani,Thang X. Vu,Symeon Chatzinotas,Björn Ottersten

from arxiv, 15 pages, 8 figures

A collaborative task is assigned to a multiagent system (MAS) in which agents are allowed to communicate. The MAS runs over an underlying Markov decision process and its task is to maximize the averaged sum of discounted one-stage rewards. Although knowing the global state of the environment is necessary for the optimal action selection of the MAS, agents are limited to individual observations. Inter-agent communication can tackle the issue of local observability, however, the limited rate of inter-agent communication prevents the agents from acquiring the precise global state information. To overcome this challenge, agents need to communicate an abstract version of their observations to each other such that the MAS compromises the minimum possible sum of rewards. We show that this problem is equivalent to a form of rate-distortion problem, which we call task-based information compression (TBIC). We introduce state aggregation for information compression (SAIC) to solve the TBIC problem. SAIC is shown to achieve near-optimal performance in terms of the achieved sum of discounted rewards. The proposed algorithm is applied to a rendezvous problem and its performance is compared with several benchmarks. Numerical experiments confirm the superiority of the proposed algorithm.

翻译：合作任务被指派给多试剂系统,允许代理商进行交流。MAS运行于一个基本的Markov决策程序,任务是最大限度地提高折扣单阶段奖励的平均和折扣额。虽然知道环境的全球状况对于最佳行动选择MAS是必要的,但代理商仅限于个别观察。机构间通信可以解决当地可观察性问题,但是,代理商通信的比例有限,使代理商无法获得准确的全球国家信息。为了克服这一挑战,MAS需要将其观察结果的抽象版本传递给对方,以便MAS会损及最低可能的报酬总和。我们表明,这个问题相当于一种标准扭曲问题,我们称之为基于任务的信息压缩(TBIC)。我们引入信息压缩国家汇总(SAIC)以解决TBIC问题。SAIC显示,在所实现的折扣报酬总和方面,其业绩接近最佳。拟议的算法适用于会合问题,其性能与若干基准比较。Numerical实验证实了拟议的算法的优越性。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【斯坦福新书】决策算法，464页pdf，Algorithms for Decision Making

【斯坦福新书】决策算法，464页pdf，Algorithms for Decision Making

专知会员服务

124+阅读 · 2020年12月7日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

【KDD2020】基于知识图谱的语义融合改进会话推荐系统，Improving Conversational Recommender Systems via Knowledge Graph based Semantic Fusion

【KDD2020】基于知识图谱的语义融合改进会话推荐系统，Improving Conversational Recommender Systems via Knowledge Graph based Semantic Fusion

专知会员服务

90+阅读 · 2020年7月9日

【ACL2020】对抗性文本生成，Improving Adversarial Text Generation

专知会员服务

52+阅读 · 2020年5月5日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

专知会员服务

26+阅读 · 2019年10月3日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Corrupted Contextual Bandits with Action Order Constraints

Arxiv

0+阅读 · 2021年10月12日

Self-guided Approximate Linear Programs

Arxiv

0+阅读 · 2021年10月12日

Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration

Arxiv

0+阅读 · 2021年10月12日

Randomized Exploration for Non-Stationary Stochastic Linear Bandits

Arxiv

0+阅读 · 2021年10月11日

ProgFed: Effective, Communication, and Computation Efficient Federated Learning by Progressive Training

Arxiv

0+阅读 · 2021年10月11日

Asymptotically Achieving Centralized Rate on the Decentralized Network MISO Channel

Arxiv

0+阅读 · 2021年10月11日

Efficiently and Globally Solving Joint Beamforming and Compression Problem in the Cooperative Cellular Network via Lagrangian Duality

Arxiv

0+阅读 · 2021年10月11日

On the asymptotical regularization with convex constraints for inverse problems

Arxiv

0+阅读 · 2021年10月10日

Pareto Optimization for Subset Selection with Dynamic Cost Constraints

Arxiv

0+阅读 · 2021年10月10日

Density Constrained Reinforcement Learning

Arxiv

6+阅读 · 2021年6月24日

VIP会员

文章信息

相关主题

查准率/准确率

相关VIP内容

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【斯坦福新书】决策算法，464页pdf，Algorithms for Decision Making

【斯坦福新书】决策算法，464页pdf，Algorithms for Decision Making

专知会员服务

124+阅读 · 2020年12月7日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

【KDD2020】基于知识图谱的语义融合改进会话推荐系统，Improving Conversational Recommender Systems via Knowledge Graph based Semantic Fusion

【KDD2020】基于知识图谱的语义融合改进会话推荐系统，Improving Conversational Recommender Systems via Knowledge Graph based Semantic Fusion

专知会员服务

90+阅读 · 2020年7月9日

【ACL2020】对抗性文本生成，Improving Adversarial Text Generation

专知会员服务

52+阅读 · 2020年5月5日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

专知会员服务

26+阅读 · 2019年10月3日

热门VIP内容

开通专知VIP会员享更多权益服务

小规模训练指南：打造世界级大语言模型的关键方法

无人机编队飞行：复杂环境中作战的策略、挑战与应用

大模型APP，AI时代第一个爆款

从数据中心视角出发的高效大语言模型训练综述

相关资讯

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Corrupted Contextual Bandits with Action Order Constraints

Arxiv

0+阅读 · 2021年10月12日

Self-guided Approximate Linear Programs

Arxiv

0+阅读 · 2021年10月12日

Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration

Arxiv

0+阅读 · 2021年10月12日

Randomized Exploration for Non-Stationary Stochastic Linear Bandits

Arxiv

0+阅读 · 2021年10月11日

ProgFed: Effective, Communication, and Computation Efficient Federated Learning by Progressive Training

Arxiv

0+阅读 · 2021年10月11日

Asymptotically Achieving Centralized Rate on the Decentralized Network MISO Channel

Arxiv

0+阅读 · 2021年10月11日

Efficiently and Globally Solving Joint Beamforming and Compression Problem in the Cooperative Cellular Network via Lagrangian Duality

Arxiv

0+阅读 · 2021年10月11日

On the asymptotical regularization with convex constraints for inverse problems

Arxiv

0+阅读 · 2021年10月10日

Pareto Optimization for Subset Selection with Dynamic Cost Constraints

Arxiv

0+阅读 · 2021年10月10日

Density Constrained Reinforcement Learning

Arxiv

6+阅读 · 2021年6月24日

微信扫码咨询专知VIP会员