For general-sum, n-player, strategic games with transferable utility, the Harsanyi-Shapley value provides a computable method to both 1) quantify the strategic value of a player; and 2) make cooperation rational through side payments. We give a simple formula to compute the HS value in normal-form games. Next, we provide two methods to generalize the HS values to stochastic (or Markov) games, and show that one of them may be computed using generalized Q-learning algorithms. Finally, an empirical validation is performed on stochastic grid-games with three or more players. Source code is provided to compute HS values for both the normal-form and stochastic game setting.
翻译:对于一般和、正玩家、具有可转让用途的战略游戏,Harsanyi-Shapley值提供了一种可计算的方法,以便(1) 量化玩家的战略价值;(2) 通过侧面付款使合作合理。我们给出了一个简单的公式来计算普通游戏中的 HS 值。接下来,我们提供了两种方法来将 HS 值推广到随机游戏(或Markov),并显示其中一种值可以使用通用的Q- 学习算法计算。最后,在与三个或三个以上玩家的随机电网格游戏中进行了经验验证。提供了源代码来计算正常和随机游戏设置的 HS 值。</s>