Manoeuvring in the presence of emergency vehicles is still a major issue for vehicle autonomy systems. Most studies that address this topic are based on rule-based methods, which cannot cover all possible scenarios that can take place in autonomous traffic. Multi-Agent Proximal Policy Optimisation (MAPPO) has recently emerged as a powerful method for autonomous systems because it allows for training in thousands of different situations. In this study, we present an approach based on MAPPO to guarantee the safe and efficient manoeuvring of autonomous vehicles in the presence of an emergency vehicle. We introduce a risk metric that summarises the potential risk of collision in a single index. The proposed method generates cooperative policies allowing the emergency vehicle to go at $15 \%$ higher average speed while maintaining high safety distances. Moreover, we explore the trade-off between safety and traffic efficiency and assess the performance in a competitive scenario.
翻译:紧急车辆出现时的机动仍然是车辆自治系统的一个主要问题。处理这一专题的大多数研究都以基于规则的方法为基础,无法涵盖在自主交通中可能发生的所有可能情况。多正向正向政策优化(MAPPO)最近成为自主系统的一个有力方法,因为它允许在数千种不同情况下进行培训。在本研究中,我们提出了一个基于MAPPO的办法,以保障在紧急车辆出现时安全、高效地操纵自主车辆。我们引入了一种风险指标,用单一指数来概括碰撞的潜在风险。拟议方法产生了合作政策,允许紧急车辆在保持高安全距离的同时以15美元的平均速度行驶。此外,我们探索安全与交通效率之间的权衡,并在竞争的情况下评估性能。