Seamlessly operating an autonomous vehicle in a crowded pedestrian environment is a very challenging task. This is because human movement and interactions are very hard to predict in such environments. Recent work has demonstrated that reinforcement learning-based methods have the ability to learn to drive in crowds. However, these methods can have very poor performance due to inaccurate predictions of the pedestrians' future state as human motion prediction has a large variance. To overcome this problem, we propose a new method, SARL-SGAN-KCE, that combines a deep socially aware attentive value network with a human multimodal trajectory prediction model to help identify the optimal driving policy. We also introduce a novel technique to extend the discrete action space with minimal additional computational requirements. The kinematic constraints of the vehicle are also considered to ensure smooth and safe trajectories. We evaluate our method against the state of art methods for crowd navigation and provide an ablation study to show that our method is safer and closer to human behaviour.
翻译:在拥挤的行人环境中无缝操作自主车辆是一项极具挑战性的任务。这是因为人类流动和互动在这种环境中很难预测。最近的工作表明,强化学习方法有能力在人群中驾车。然而,由于对行人未来状态的预测不准确,这些方法的性能可能非常差,因为人类运动预测存在巨大差异。为了克服这一问题,我们提出了一种新的方法,即SARSL-SGAN-KCE,将一个深为社会意识的注意价值网络与一个人类多式联运轨迹预测模型结合起来,以帮助确定最佳驾驶政策。我们还引进了一种新技术,以尽量少增加计算要求的方式扩大离散行动空间。车辆的动态限制也被视为确保滑动和安全的轨迹。我们对照人群导航的艺术方法评估了我们的方法,并提供了一种模拟研究,以表明我们的方法更安全、更接近人类行为。