Robots moving safely and in a socially compliant manner in dynamic human environments is an essential benchmark for long-term robot autonomy. However, it is not feasible to learn and benchmark social navigation behaviors entirely in the real world, as learning is data-intensive, and it is challenging to make safety guarantees during training. Therefore, simulation-based benchmarks that provide abstractions for social navigation are required. A framework for these benchmarks would need to support a wide variety of learning approaches, be extensible to the broad range of social navigation scenarios, and abstract away the perception problem to focus on social navigation explicitly. While there have been many proposed solutions, including high fidelity 3D simulators and grid world approximations, no existing solution satisfies all of the aforementioned properties for learning and evaluating social navigation behaviors. In this work, we propose SOCIALGYM, a lightweight 2D simulation environment for robot social navigation designed with extensibility in mind, and a benchmark scenario built on SOCIALGYM. Further, we present benchmark results that compare and contrast human-engineered and model-based learning approaches to a suite of off-the-shelf Learning from Demonstration (LfD) and Reinforcement Learning (RL) approaches applied to social robot navigation. These results demonstrate the data efficiency, task performance, social compliance, and environment transfer capabilities for each of the policies evaluated to provide a solid grounding for future social navigation research.
翻译:在充满活力的人类环境中以安全和符合社会标准的方式安全移动的机器人是长期机器人自主的基本基准。然而,在现实世界中完全学习和基准社会导航行为是不可行的,因为学习是数据密集型的,因此在培训期间提供安全保障具有挑战性。因此,需要为社会导航提供抽象的模拟基准。这些基准的框架需要支持多种多样的学习方法,可以延伸到广泛的社会导航情景,并抽象地消除认识问题,明确侧重于社会导航。虽然有许多拟议解决方案,包括高度忠诚的3D模拟器和电网世界近似,但现有解决方案无法满足上述学习和评价社会导航行为的所有特性。在这项工作中,我们建议SUCTGYM,一个为社会导航提供轻量的2D模拟环境模拟环境环境,在SWEGYM的基础上建立基准情景。此外,我们提供了基准结果,用以比较和比较和比较人类工程和模型的学习方法,以便从演示(LfD)到电网世界近距离学习,没有满足所有上述特性,学习社会导航行为。我们建议SWIGYM(为了展示社会导航的每个运行效率,提供社会学习成绩评估,这些社会学习能力,以及加强社会学习能力,以展示社会导航的进度数据,这些技术,这些应用的学习能力,以展示,这些社会导航的学习能力,以展示,这些社会导航能力,这些技术的进度的进度的进度的进度,提供这些技术,以展示,以证明的进度的进度的进度)提供这些基准环境的进度。