We consider strategic settings where several users engage in a repeated online interaction, assisted by regret-minimizing agents that repeatedly play a "game" on their behalf. We study the dynamics and average outcomes of the repeated game of the agents, and view it as inducing a meta-game between the users. Our main focus is on whether users can benefit in this meta-game from "manipulating" their own agent by mis-reporting their parameters to it. We formally define this "user-agent meta-game" model for general games, discuss its properties under different notions of convergence of the dynamics of the automated agents and analyze the equilibria induced on the users in 2x2 games in which the dynamics converge to a single equilibrium.
 翻译:我们考虑一些战略环境,让几个用户在代表他们反复玩“游戏”的“游戏”的遗憾最小化剂的协助下,反复进行在线互动。我们研究了代理器反复游戏的动态和平均结果,并将它视为引发用户之间的元游戏。我们的主要焦点是用户是否可以通过错误地向它报告参数而从“管理”自己的代理器的元游戏中受益。我们正式定义了普通游戏的“用户-代理元游戏”模式,根据自动代理器动态趋同的不同概念讨论其属性,并分析了2x2游戏中向用户带来的平衡,其中动态会合到单一平衡。