In order to practically implement the door opening task, a policy ought to be robust to a wide distribution of door types and environment settings. Reinforcement Learning (RL) with Domain Randomization (DR) is a promising technique to enforce policy generalization, however, there are only a few accessible training environments that are inherently designed to train agents in domain randomized environments. We introduce DoorGym, an open-source door opening simulation framework designed to utilize domain randomization to train a stable policy. We intend for our environment to lie at the intersection of domain transfer, practical tasks, and realism. We also provide baseline Proximal Policy Optimization and Soft Actor-Critic implementations, which achieves success rates between 0% up to 95% for opening various types of doors in this environment. Moreover, the real-world transfer experiment shows the trained policy is able to work in the real world. Environment kit available here: https://github.com/PSVL/DoorGym/
翻译:为了切实落实开关任务,一项政策应当对广泛分布的门型和环境环境环境进行强有力的分配。与域随机化(DR)一起的强化学习(RL)是执行政策一般化的有希望的方法,然而,只有为数不多的无障碍培训环境,其内在设计是为了在域随机化环境中培训代理。我们引入了开放源门开关模拟框架DoorGym, 目的是利用域随机化来训练稳定的政策。我们打算让环境处于域转移、实际任务和现实主义的交叉点。我们还提供基线的优化政策和软动作-Critical执行,在这种环境中打开各种门的成功率在0%至95%之间。此外,现实世界转移实验显示,经过培训的政策能够在现实世界中发挥作用。这里提供的环境工具包有:https://github.com/PSVL/DoorGym/。