Learning to cooperate with other agents is challenging when those agents also possess the ability to adapt to our own behavior. Practical and theoretical approaches to learning in cooperative settings typically assume that other agents' behaviors are stationary, or else make very specific assumptions about other agents' learning processes. The goal of this work is to understand whether we can reliably learn to cooperate with other agents without such restrictive assumptions, which are unlikely to hold in real-world applications. Our main contribution is a set of impossibility results, which show that no learning algorithm can reliably learn to cooperate with all possible adaptive partners in a repeated matrix game, even if that partner is guaranteed to cooperate with some stationary strategy. Motivated by these results, we then discuss potential alternative assumptions which capture the idea that an adaptive partner will only adapt rationally to our behavior.
翻译:当这些代理商也具备适应我们自身行为的能力时,学习与其他代理商合作是具有挑战性的。在合作环境中学习的实践和理论方法通常假定其他代理商的行为是静止的,或者对其他代理商的学习过程做出非常具体的假设。 这项工作的目标是了解我们是否能够可靠地学习与其他代理商合作,而没有这种限制性假设,这些假设在现实世界应用中不可能维持。 我们的主要贡献是一系列不可能的结果,这表明任何学习算法都无法可靠地学习与所有可能的适应性伙伴在反复的矩阵游戏中合作,即使该伙伴被保证与某种固定的战略合作。 受这些结果的激励,我们然后讨论潜在的替代假设,这些假设抓住了适应性伙伴只能合理适应我们行为的想法。