We study stochastic zero-sum games on graphs, which are prevalent tools to model decision-making in presence of an antagonistic opponent in a random environment. In this setting, an important question is the one of strategy complexity: what kinds of strategies are sufficient or required to play optimally (e.g., randomization or memory requirements)? Our contributions further the understanding of arena-independent finite-memory (AIFM) determinacy, i.e., the study of objectives for which memory is needed, but in a way that only depends on limited parameters of the game graphs. First, we show that objectives for which pure AIFM strategies suffice to play optimally also admit pure AIFM subgame perfect strategies. Second, we show that we can reduce the study of objectives for which pure AIFM strategies suffice in two-player stochastic games to the easier study of one-player stochastic games (i.e., Markov decision processes). Third, we characterize the sufficiency of AIFM strategies through two intuitive properties of objectives. This work extends a line of research started on deterministic games in [BLO+20] to stochastic ones. [BLO+20] Patricia Bouyer, St\'ephane Le Roux, Youssouf Oualhadj, Mickael Randour, and Pierre Vandenhove. Games Where You Can Play Optimally with Arena-Independent Finite Memory. CONCUR 2020.
翻译:我们研究图表上的零和游戏,这是在随机环境中当着对立对手进行模拟决策的常用工具。在这个背景下,一个重要的问题是战略复杂性:什么样的战略足够或需要最优化地发挥(如随机化或记忆要求)?我们的贡献进一步增进了对视场独立有限游戏(AIFM)确定性的理解,即对需要记忆的目标的研究,但只能依赖游戏图表的有限参数。首先,我们展示了纯的AIFM战略足以最理想地接受纯的AIFM亚游戏完美战略的目标:哪些战略足以或需要最优化地发挥(如随机化或记忆要求)?我们的贡献进一步增进了对单玩游戏(即Markov决定程序)的轻松研究。我们通过目标的两个直观性属性来描述AIFM战略的充足性。本项工作把关于确定性游戏的系列研究范围从PARIC+OVA、OFOA、ROFO和ROFOFO。