We present a generic strategy iteration algorithm (GSIA) to find an optimal strategy of a simple stochastic game (SSG). We prove the correctness of GSIA, and derive a general complexity bound, which implies and improves on the results of several articles. First, we remove the assumption that the SSG is stopping, which is usually obtained by a polynomial blowup of the game. Second, we prove a tight bound on the denominator of the values associated to a strategy, and use it to prove that all strategy iteration algorithms are in fact fixed parameter tractable in the number of random vertices. All known strategy iteration algorithms can be seen as instances of GSIA, which allows to analyze the complexity of converge from below by Condon and to propose a class of algorithms generalising Gimbert and Horn's algorithm. These algorithms require less than $r!$ iterations in general and less iterations than the current best deterministic algorithm for binary SSGs given by Ibsen-Jensen and Miltersen.
翻译:我们提出了一个通用战略传动算法(GSIA),以寻找简单随机游戏的最佳策略。我们证明了GSIA的正确性,并得出了一个总体复杂性约束,这意味着并改进了多篇文章的结果。首先,我们取消了SSG停止的假设,通常通过对游戏的多面性打击获得。第二,我们证明对战略相关值的分母有着紧密的界限,并用它来证明所有战略传动算法事实上都是随机脊椎数中固定的参数。所有已知的战略传动算法都可以被视为GSIA的例子,从而可以分析Condon从下面汇集的复杂程度,并提议一种概括Gimbert和Horn的算法的算法类别。这些算法一般需要比Ibsen-Jensen和Milteren提供的二进制SSG的当前最佳确定性算法要少得多。