Rating strategies in a game is an important area of research in game theory and artificial intelligence, and can be applied to any real-world competitive or cooperative setting. Traditionally, only transitive dependencies between strategies have been used to rate strategies (e.g. Elo), however recent work has expanded ratings to utilize game theoretic solutions to better rate strategies in non-transitive games. This work generalizes these ideas and proposes novel algorithms suitable for N-player, general-sum rating of strategies in normal-form games according to the payoff rating system. This enables well-established solution concepts, such as equilibria, to be leveraged to efficiently rate strategies in games with complex strategic interactions, which arise in multiagent training and real-world interactions between many agents. We empirically validate our methods on real world normal-form data (Premier League) and multiagent reinforcement learning agent evaluation.
翻译:游戏中的评分战略是游戏理论和人工智能研究的一个重要领域,可以适用于任何现实世界的竞争或合作环境。传统上,只有战略之间的过渡依赖性才被用于评分战略(例如Elo ),但最近的工作扩大了评分,利用游戏理论解决方案来提高非过渡游戏的评分战略的评分率。这项工作概括了这些想法,并提出了适合N型玩家的新算法,根据报偿评分制度对正常游戏中的战略进行一般和一般评分。这使得成熟的解决方案概念,如平衡概念,能够用于在具有复杂战略互动的游戏中有效评分战略,这些互动产生于多剂培训和许多代理之间的现实世界互动。我们用经验验证了我们关于真实世界正常数据(Primier Al联盟)和多剂强化学习代理评价的方法。