While there has been an increasing focus on the use of game theoretic models for autonomous driving, empirical evidence shows that there are still open questions around dealing with the challenges of common knowledge assumptions as well as modeling bounded rationality. To address some of these practical challenges, we develop a framework of generalized dynamic cognitive hierarchy for both modelling naturalistic human driving behavior as well as behavior planning for autonomous vehicles (AV). This framework is built upon a rich model of level-0 behavior through the use of automata strategies, an interpretable notion of bounded rationality through safety and maneuver satisficing, and a robust response for planning. Based on evaluation on two large naturalistic datasets as well as simulation of critical traffic scenarios, we show that i) automata strategies are well suited for level-0 behavior in a dynamic level-k framework, and ii) the proposed robust response to a heterogeneous population of strategic and non-strategic reasoners can be an effective approach for game theoretic planning in AV.
翻译:虽然人们越来越重视使用游戏理论模型进行自主驾驶,但经验证据表明,在应对共同知识假设和模拟界限合理性的挑战方面,仍有一些尚未解决的问题。为了应对其中一些实际挑战,我们为模拟自然人类驾驶行为和自主车辆行为规划(AV)开发了一个通用的动态认知等级框架。这个框架基于一个丰富的0级行为模型,即通过使用自动数据战略,一个可解释的通过安全和操纵讽刺实现约束性合理性的概念,以及对规划的有力反应。根据对两个大型自然数据集的评价以及关键交通情景的模拟,我们表明,一) 自动数据战略非常适合动态水平框架中的0级行为,二) 拟议的对战略和非战略理性者混杂人群的有力反应可以成为AV游戏理论规划的有效方法。