We present Megaverse, a new 3D simulation platform for reinforcement learning and embodied AI research. The efficient design of our engine enables physics-based simulation with high-dimensional egocentric observations at more than 1,000,000 actions per second on a single 8-GPU node. Megaverse is up to 70x faster than DeepMind Lab in fully-shaded 3D scenes with interactive objects. We achieve this high simulation performance by leveraging batched simulation, thereby taking full advantage of the massive parallelism of modern GPUs. We use Megaverse to build a new benchmark that consists of several single-agent and multi-agent tasks covering a variety of cognitive challenges. We evaluate model-free RL on this benchmark to provide baselines and facilitate future research. The source code is available at https://www.megaverse.info
翻译:我们展示了三维新模拟平台,用于强化学习,并体现了AI研究。我们的引擎高效设计使基于物理的模拟能够以高维自我中心观察每秒在8-GPU节点上进行100万次以上的高维自我中心观测。在全光3D场景中,Megaverse比DeepMind实验室快70倍。我们通过利用分批模拟,充分利用现代GPU的巨大平行性来取得这种高的模拟性能。我们利用Megaverse建立一个新的基准,其中包括若干个单一试剂和多试剂任务,涵盖各种认知挑战。我们评估这一基准的无型RL,以提供基线和便利未来的研究。源代码可在https://www.megaverse.info查阅。