The ability to estimate human intentions and interact with human drivers intelligently is crucial for autonomous vehicles to successfully achieve their objectives. In this paper, we propose a game theoretic planning algorithm that models human opponents with an iterative reasoning framework and estimates human latent cognitive states through probabilistic inference and active learning. By modeling the interaction as a partially observable Markov decision process with adaptive state and action spaces, our algorithm is able to accomplish real-time lane changing tasks in a realistic driving simulator. We compare our algorithm's lane changing performance in dense traffic with a state-of-the-art autonomous lane changing algorithm to show the advantage of iterative reasoning and active learning in terms of avoiding overly conservative behaviors and achieving the driving objective successfully.
翻译:明智地估计人类意图和与人驾驶员互动的能力对于自主飞行器成功实现其目标至关重要。 在本文中,我们提出一个游戏理论规划算法,用迭代推理框架模拟人类反对者,并通过概率推论和积极学习来估计人类潜伏认知状态。 通过将这种互动模拟作为部分可见的Markov决策程序与适应性状态和行动空间进行部分观察,我们的算法能够在一个现实的驾驶模拟器中完成实时道变化任务。我们比较了我们的算法在密集交通中改变性能的轨迹与最先进的自主航道变化算法,以显示迭代推理和积极学习在避免过度保守行为和成功实现驱动目标方面的优势。