We explore some strategies which tend to perform well in the IPD. We start off by showing the significance of Tit-For-Tat strategies in evolutionary game theory. This is followed by a theoretical derivation of zero-determinant strategies, where we highlight an error on bounds for scale parameters from the original paper on ZD strategies[6]. We then present examples of such strategies and create a custom player drawing inspiration from Markov Decision Processes. At the end we pit them all against each other and see how they perform in an IPD tournament.
翻译:我们探索一些在IPD中表现良好的策略。 我们首先展示Tit- For-Tat策略在进化游戏理论中的重要性。 之后是零决定性策略的理论衍生, 我们从零决定性策略的原始论文中强调了比例参数界限上的错误[6]。 我们然后展示这些策略的例子, 并创建一个从Markov 决策程序中得到启发的自定义玩家。 最后, 我们把它们都放在对立的位置上, 看看他们在IPD锦标赛中的表现如何 。