Driving safely requires multiple capabilities from human and intelligent agents, such as the generalizability to unseen environments, the decision making in complex multi-agent settings, and the safety awareness of the surrounding traffic. Despite the great success of reinforcement learning, most of the RL research studies each capability separately due to the lack of the integrated interactive environments. In this work, we develop a new driving simulation platform called MetaDrive for the study of generalizable reinforcement learning algorithms. MetaDrive is highly compositional, which can generate an infinite number of diverse driving scenarios from both the procedural generation and the real traffic data replay. Based on MetaDrive, we construct a variety of RL tasks and baselines in both single-agent and multi-agent settings, including benchmarking generalizability across unseen scenes, safe exploration, and learning multi-agent traffic. We open-source this simulator and maintain its development at: https://github.com/decisionforce/metadrive
翻译:安全驾驶需要来自人类和智能物剂的多种能力,例如通用到看不见的环境、复杂的多剂环境中的决策以及周围交通的安全意识。尽管强化学习取得了巨大成功,但大多数RL研究都因缺乏综合互动环境而分别对每种能力进行了研究。在这项工作中,我们开发了一个名为MetaDrive的新的驱动模拟平台,用于研究可普遍适用的强化学习算法。MetaDrive具有高度的构成性,能够从程序生成和真实的交通数据重播中产生无数不同的驱动场景。基于MetaDrive,我们在单剂和多剂环境中都建立了各种RL任务和基线,包括设定在看不见的场景、安全探索和学习多剂交通的基准。我们开发了这个模拟器,并在以下网址保持其开发:https://github.com/decisforce/metaddrive。