While many multi-robot coordination problems can be solved optimally by exact algorithms, solutions are often not scalable in the number of robots. Multi-Agent Reinforcement Learning (MARL) is gaining increasing attention in the robotics community as a promising solution to tackle such problems. Nevertheless, we still lack the tools that allow us to quickly and efficiently find solutions to large-scale collective learning tasks. In this work, we introduce the Vectorized Multi-Agent Simulator (VMAS). VMAS is an open-source framework designed for efficient MARL benchmarking. It is comprised of a vectorized 2D physics engine written in PyTorch and a set of twelve challenging multi-robot scenarios. Additional scenarios can be implemented through a simple and modular interface. We demonstrate how vectorization enables parallel simulation on accelerated hardware without added complexity. When comparing VMAS to OpenAI MPE, we show how MPE's execution time increases linearly in the number of simulations while VMAS is able to execute 30,000 parallel simulations in under 10s, proving more than 100x faster. Using VMAS's RLlib interface, we benchmark our multi-robot scenarios using various Proximal Policy Optimization (PPO)-based MARL algorithms. VMAS's scenarios prove challenging in orthogonal ways for state-of-the-art MARL algorithms. The VMAS framework is available at https://github.com/proroklab/VectorizedMultiAgentSimulator. A video of VMAS scenarios and experiments is available at https://youtu.be/aaDRYfiesAY.
翻译:虽然许多多机器人协调问题可以通过精确的算法以最佳方式解决,但解决方案往往无法在机器人数量上加以伸缩。多代理人强化学习(MARL)正在机器人界日益受到越来越多的关注,作为解决这类问题的有希望的解决办法。然而,我们仍缺乏工具,使我们无法迅速和有效地找到大规模集体学习任务的解决方案。在这项工作中,我们引入了矢量化多显性模拟器(VMAS)。VMAS是一个为有效的MAL基准设定而设计的开放源框架。它由以平托奇书写的矢量化 2D 物理引擎和一组具有挑战性的多机器人设想方案组成。额外设想方案可以通过一个简单和模块界面来实施。我们展示了矢量化能够快速和高效地找到大规模集体学习任务的解决方案。我们展示了MPE的执行时间在模拟数量上线性增加,而VMAS能够在10个以下执行30 000个平行的视频模拟,证明速度超过100个。在PRalibal-D界面上使用VMALMAL-MALA(我们用多MAL)的多MALA方式测试了我们多MALA-A。