We analyze an extended model of the Iterated Prisoner's Dilemma where agents decide to play based on the data from their limited memory or recommendations. The cooperators can decide whether to play with the matched opponent or not. The agents' decisions are directly linked to their level of optimism since they decide to play if they believe the opponent has a high probability of cooperating. Optimism is precisely tuned by parameters optimism threshold and tolerance. Our experiment showed that being optimistic is better for cooperators as it leads to more accurate exploration in the multi-agent system, which tolerates the vulnerability against defectors.
翻译:我们分析过热的囚犯困境的扩大模型,其中代理人根据有限的记忆或建议中的数据决定玩耍。 合作者可以决定是否和对手玩耍。 合作者的决定与其乐观程度直接相关,因为他们认为如果对方认为对方合作的概率很高,他们决定玩耍。 乐观主义与参数乐观阈值和容忍度完全一致。 我们的实验表明,乐观对于合作者来说更好,因为它导致在多试剂系统中进行更准确的探索,因为多试剂系统能容忍对叛逃者的脆弱性。