In this work we present a hierarchical framework for solving discrete stochastic pursuit-evasion games (PEGs) in large grid worlds. With a partition of the grid world into superstates (e.g., "rooms"), the proposed approach creates a two-resolution decision-making process, which consists of a set of local PEGs at the original state level and an aggregated PEG at the superstate level. Having much smaller cardinality, both the local games and the aggregated game can be easily solved to a Nash equilibrium. To connect the decision-making at the two resolutions, we use the Nash values of the local PEGs as the rewards for the aggregated game. Through numerical simulations, we show that the proposed hierarchical framework significantly reduces the computation overhead, while still maintaining a satisfactory level of performance when competing against the flat Nash policies.
翻译:在这项工作中,我们提出了一个在大网格世界中解决离散的随机逃生游戏(PEGs)的分级框架。在将网格世界分割成超级国家(例如“室”)的情况下,拟议办法产生了一个双分制的决策过程,其中包括最初州一级的一套当地PEGs和在超级国家一级的综合PEG。由于离散的基点小得多,本地的游戏和合并的游戏都可以很容易地解决成纳什平衡。为了将两个决议的决策联系起来,我们使用当地PEGs的纳什值作为综合游戏的奖励。我们通过数字模拟显示,拟议的等级框架大大降低了计算间接费用,同时在与平板的纳什政策竞争时仍然保持令人满意的业绩水平。