Prior research has extensively explored Autonomous Vehicle (AV) navigation in the presence of other vehicles, however, navigation among pedestrians, who are the most vulnerable element in urban environments, has been less examined. This paper explores AV navigation in crowded, unsignalized intersections. We compare the performance of different deep reinforcement learning methods trained on our reward function and state representation. The performance of these methods and a standard rule-based approach were evaluated in two ways, first at the unsignalized intersection on which the methods were trained, and secondly at an unknown unsignalized intersection with a different topology. For both scenarios, the rule-based method achieves less than 40\% collision-free episodes, whereas our methods result in a performance of approximately 100\%. Of the three methods used, DDQN/PER outperforms the other two methods while it also shows the smallest average intersection crossing time, the greatest average speed, and the greatest distance from the closest pedestrian.
翻译:先前的研究广泛探索了自动机动车辆在其他车辆面前的导航,然而,在城市环境中最易受伤害的行人之间航行的情况却较少。本文探讨了在拥挤、无信号交界处的AV导航情况。我们比较了就我们的奖励功能和国家代表性所培训的不同深度强化学习方法的绩效。这些方法和标准规则方法的绩效以两种方式进行了评价,一种是在方法培训的未信号交界处,另一种是在与不同地形的未知的未信号交接处。两种情况中,基于规则的方法都取得了不到40 ⁇ 的无碰撞事件,而我们的方法则产生了大约100 ⁇ 的性能。在使用的三个方法中,DDQN/PER比其他两种方法都好,同时也显示了最小的平均交叉时间、最高的平均速度和距离最近的行人。