宠物动物园:多机构强化学习健身 (PettingZoo: Gym for Multi-Agent Reinforcement Learning)

J. K. Terry,Benjamin Black,Nathaniel Grammel,Mario Jayakumar,Ananth Hari,Ryan Sullivan,Luis Santos,Rodrigo Perez,Caroline Horsch,Clemens Dieffendahl,Niall L. Williams,Yashas Lokesh,Praveen Ravi

This paper introduces the PettingZoo library and the accompanying Agent Environment Cycle ("AEC") games model. PettingZoo is a library of diverse sets of multi-agent environments with a universal, elegant Python API. PettingZoo was developed with the goal of accelerating research in Multi-Agent Reinforcement Learning ("MARL"), by making work more interchangeable, accessible and reproducible akin to what OpenAI's Gym library did for single-agent reinforcement learning. PettingZoo's API, while inheriting many features of Gym, is unique amongst MARL APIs in that it's based around the novel AEC games model. We argue, in part through case studies on major problems in popular MARL environments, that the popular game models are poor conceptual models of the games commonly used with MARL, that they promote severe bugs that are hard to detect, and that the AEC games model addresses these problems.

翻译：本文介绍了宠物动物园图书馆和伴随的代理环境循环(“AEC”)游戏模式。宠物动物园是多试剂环境各组的图书馆,拥有通用、优雅的 Python API。开发宠物动物园的目的是加速多代理强化学习(“MARL ”)的研究, 使工作更便于互换、更方便和可复制, 类似于 OpenAI Gym 图书馆为单试剂强化学习所做的工作。宠物动物园的 API 虽然继承了 Gym 的许多特征, 但它在MARL API 的多个特征中是独一无二的, 因为它以新颖的 AEC 游戏模式为基础。我们在一定程度上通过对流行的 MARL 环境中的主要问题进行案例研究, 认为流行的游戏模式是与 MARL 常用的游戏的不良概念模型, 它们推广了难以检测的严重错误, 而 AEC 游戏模式解决这些问题。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

131+阅读 · 2020年4月19日

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

专知会员服务

41+阅读 · 2020年4月11日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日