Robots deployed in real-world environments should operate safely in a robust manner. In scenarios where an "ego" agent navigates in an environment with multiple other "non-ego" agents, two modes of safety are commonly proposed -- adversarial robustness and probabilistic constraint satisfaction. However, while the former is generally computationally intractable and leads to overconservative solutions, the latter typically relies on strong distributional assumptions and ignores strategic coupling between agents. To avoid these drawbacks, we present a novel formulation of robustness within the framework of general-sum dynamic game theory, modeled on defensive driving. More precisely, we prepend an adversarial phase to the ego agent's cost function. That is, we prepend a time interval during which other agents are assumed to be temporarily distracted, in order to render the ego agent's equilibrium trajectory robust against other agents' potentially dangerous behavior during this time. We demonstrate the effectiveness of our new formulation in encoding safety via multiple traffic scenarios.
翻译:在现实世界环境中部署的机器人应该以稳健的方式安全地运行。在“ego”代理器在有多个其他“n-ego”代理器的环境中航行的情况下,通常会提出两种安全模式 -- -- 对抗性稳健性和概率约束性满足。然而,虽然前者一般是计算上难以解决的,导致过度保守的解决办法,但后者通常依赖强大的分布假设,忽视代理器之间的战略组合。为了避免这些缺陷,我们在以防御性驾驶为模型的一般和动态游戏理论框架内提出了一种新型的强健性配方。更确切地说,我们把对抗性阶段设定在自我代理器的成本功能上。也就是说,我们预先设定了一个时间间隔期,假设其他代理器暂时分心,以便使自我代理器的平衡轨迹在这段时间里能够针对其他代理器的潜在危险行为而强大起来。我们通过多种交通情景展示了我们新的编码安全公式的有效性。