Many important real-world settings contain multiple players interacting over an unknown duration with probabilistic state transitions, and are naturally modeled as stochastic games. Prior research on algorithms for stochastic games has focused on two-player zero-sum games, games with perfect information, and games with imperfect-information that is local and does not extend between game states. We present an algorithm for approximating Nash equilibrium in multiplayer general-sum stochastic games with persistent imperfect information that extends throughout game play. We experiment on a 4-player imperfect-information naval strategic planning scenario. Using a new procedure, we are able to demonstrate that our algorithm computes a strategy that closely approximates Nash equilibrium in this game.
翻译:许多重要的现实世界设置包含多个玩家在未知的时间段内进行互动,并带有概率性国家转型,并且自然地被模拟为随机游戏。 先前关于随机游戏的算法研究侧重于两个玩家零和游戏,拥有完美信息的游戏,以及具有不完善信息的游戏,而这种游戏是本地的,且在游戏状态之间并不延伸。我们在多玩家一般和随机游戏中提出了一个接近纳什平衡的算法,这种游戏具有持续不完善的信息,贯穿于游戏的全过程。我们实验了四玩家不完善的信息海军战略规划情景。我们使用新的程序,能够证明我们的算法是接近纳什平衡的策略。