分散式结构- RNNN 用于机器人人群导航和深强化学习 (Decentralized Structural-RNN for Robot Crowd Navigation with Deep Reinforcement Learning)

Safe and efficient navigation through human crowds is an essential capability for mobile robots. Previous work on robot crowd navigation assumes that the dynamics of all agents are known and well-defined. In addition, the performance of previous methods deteriorates in partially observable environments and environments with dense crowds. To tackle these problems, we propose decentralized structural-Recurrent Neural Network (DS-RNN), a novel network that reasons about spatial and temporal relationships for robot decision making in crowd navigation. We train our network with model-free deep reinforcement learning without any expert supervision. We demonstrate that our model outperforms previous methods in challenging crowd navigation scenarios. We successfully transfer the policy learned in the simulator to a real-world TurtleBot 2i.

翻译：通过人群进行安全和高效的导航是移动机器人的基本能力。以前关于机器人人群导航的工作假定所有物剂的动态是已知的和定义明确的。此外,在部分可见的环境下和人群稠密的环境中,以往方法的性能会恶化。为了解决这些问题,我们提议分散结构-实时神经网络(DS-RNNN),这是一个新颖的网络,可以解释在人群导航中机器人决策的空间和时间关系。我们培训我们的网络,在没有任何专家监督的情况下进行无型深层强化学习。我们证明我们的模型在挑战人群导航情景方面比以往的方法要好。我们成功地将模拟器所学的政策转移到现实世界的TurturtBot 2i 。

相关内容

深度强化学习

关注 154

深度强化学习 (DRL) 是一种使用深度学习技术扩展传统强化学习方法的一种机器学习方法。传统强化学习方法的主要任务是使得主体根据从环境中获得的奖赏能够学习到最大化奖赏的行为。然而，传统无模型强化学习方法需要使用函数逼近技术使得主体能够学习出值函数或者策略。在这种情况下，深度学习强大的函数逼近能力自然成为了替代人工指定特征的最好手段并为性能更好的端到端学习的实现提供了可能。

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

专知会员服务

89+阅读 · 2021年1月12日

【MIT】反偏差对比学习，Debiased Contrastive Learning

专知会员服务

91+阅读 · 2020年7月4日

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

专知会员服务

52+阅读 · 2020年5月26日

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

专知会员服务

41+阅读 · 2020年4月11日