Metan-CPR:将具有通信模式识别模块的代理普遍化为未见的庞大通信代理 (Meta-CPR: Generalize to Unseen Large Number of Agents with Communication Pattern Recognition Module)

Designing an effective communication mechanism among agents in reinforcement learning has been a challenging task, especially for real-world applications. The number of agents can grow or an environment sometimes needs to interact with a changing number of agents in real-world scenarios. To this end, a multi-agent framework needs to handle various scenarios of agents, in terms of both scales and dynamics, for being practical to real-world applications. We formulate the multi-agent environment with a different number of agents as a multi-tasking problem and propose a meta reinforcement learning (meta-RL) framework to tackle this problem. The proposed framework employs a meta-learned Communication Pattern Recognition (CPR) module to identify communication behavior and extract information that facilitates the training process. Experimental results are poised to demonstrate that the proposed framework (a) generalizes to an unseen larger number of agents and (b) allows the number of agents to change between episodes. The ablation study is also provided to reason the proposed CPR design and show such design is effective.

翻译：在强化学习的代理商之间设计有效的沟通机制是一项艰巨的任务,对于现实应用来说尤其如此。代理商的数量可以增长,或环境有时需要与现实世界情景中不断变化的代理商数量互动。为此,多代理商框架需要处理各种代理商的情景,从规模和动态角度来说,对于现实世界应用来说都是切合实际的。我们将不同代理商数量不同的多代理商环境作为一个多重任务问题来设计,并提出一个处理该问题的元强化学习(meta-RL)框架。拟议框架使用一个元学通信模式识别模块来确定沟通行为并提取有助于培训过程的信息。实验结果将表明拟议框架(a) 概括为看不见的更多代理商数量,以及(b) 允许不同时间之间的代理商数量变化。还提供通缩研究,以说明拟议的CPR设计和展示这种设计是否有效。

相关内容

Pattern Recognition

关注 986

模式识别是一个成熟的、令人兴奋的、快速发展的领域，它支撑着计算机视觉、图像处理、文本和文档分析以及神经网络等相关领域的发展。它与机器学习非常相似，在生物识别、生物信息学、多媒体数据分析和最新的数据科学等新兴领域也有应用。模式识别（Pattern Recognition）杂志成立于大约50年前，当时该领域刚刚出现计算机科学的早期。在这期间，它已大大扩大。只要这些论文的背景得到了清晰的解释并以模式识别文献为基础，该杂志接受那些对模式识别理论、方法和在任何领域的应用做出原创贡献的论文。官网地址：http://dblp.uni-trier.de/db/conf/par/

【KDD2020】现实世界超图的结构模式和生成模型，Structural Patterns and Generative Models of Real-world Hypergraphs

专知会员服务

37+阅读 · 2020年6月16日

【Google大脑】AutoML-Zero: 从无到有演化机器学习算法，Evolving Machine Learning

专知会员服务

26+阅读 · 2020年3月11日

基于破坏和构造学习的细粒度图像识别（Destruction and Construction Learning for Fine-grained Image Recognition）

专知会员服务

20+阅读 · 2020年1月26日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

59+阅读 · 2020年1月25日