Recent advances in reinforcement learning with social agents have allowed such models to achieve human-level performance on specific interaction tasks. However, most interactive scenarios do not have a version alone as an end goal; instead, the social impact of these agents when interacting with humans is as important and largely unexplored. In this regard, this work proposes a novel reinforcement learning mechanism based on the social impact of rivalry behavior. Our proposed model aggregates objective and social perception mechanisms to derive a rivalry score that is used to modulate the learning of artificial agents. To investigate our proposed model, we design an interactive game scenario, using the Chef's Hat Card Game, and examine how the rivalry modulation changes the agent's playing style, and how this impacts the experience of human players in the game. Our results show that humans can detect specific social characteristics when playing against rival agents when compared to common agents, which directly affects the performance of the human players in subsequent games. We conclude our work by discussing how the different social and objective features that compose the artificial rivalry score contribute to our results.
翻译:社会行为主体的强化学习最近的进展使得这些模式得以在具体互动任务上实现人文层面的绩效。然而,大多数互动情景并不单以一个版本作为最终目标;相反,这些参与者在与人类互动时的社会影响同样重要,而且基本上没有探索。在这方面,这项工作提出了基于对抗行为的社会影响的新颖的强化学习机制。我们提议的模型汇总了客观和社会认知机制,以得出用于调节人工行为主体学习的对立分。为了调查我们提议的模型,我们设计了一个互动游戏情景,使用主厨的帽子牌游戏,并研究竞争调节如何改变代理人的玩耍风格,以及这如何影响游戏中人类玩家的经验。我们的结果显示,人类在与普通行为者相比,与竞争者竞争时可以发现特定的社会特征,这直接影响到后来游戏中人类玩家的表现。我们结束我们的工作,通过讨论构成人为对立分的不同社会和客观特征如何促进我们的结果。