Using artificial intelligence (AI) to automatically test a game remains a critical challenge for the development of richer and more complex game worlds and for the advancement of AI at large. One of the most promising methods for achieving that long-standing goal is the use of generative AI agents, namely procedural personas, that attempt to imitate particular playing behaviors which are represented as rules, rewards, or human demonstrations. All research efforts for building those generative agents, however, have focused solely on playing behavior which is arguably a narrow perspective of what a player actually does in a game. Motivated by this gap in the existing state of the art, in this paper we extend the notion of behavioral procedural personas to cater for player experience, thus examining generative agents that can both behave and experience their game as humans would. For that purpose, we employ the Go-Explore reinforcement learning paradigm for training human-like procedural personas, and we test our method on behavior and experience demonstrations of more than 100 players of a racing game. Our findings suggest that the generated agents exhibit distinctive play styles and experience responses of the human personas they were designed to imitate. Importantly, it also appears that experience, which is tied to playing behavior, can be a highly informative driver for better behavioral exploration.
翻译:利用人工智能(AI)自动测试游戏,对于发展更丰富、更复杂的游戏世界和整个AI的进步来说,仍然是一项关键的挑战。实现这一长期目标的最有希望的方法之一是使用基因化的AI代理,即程序人,试图模仿作为规则、奖励或人类演示代表的特殊游戏行为。然而,建设这些基因化代理的所有研究努力都仅仅侧重于玩游戏,这可以说是一个玩家在游戏中实际所做的事情的狭隘视角。我们发现,受目前艺术状态中这种差距的驱动,我们在本文中扩展行为程序人的概念,以迎合玩家的经验,从而审查既能行为又能体验游戏的基因化代理。为此,我们采用了Go-Explore强化学习模式来培训像人类的程序人,我们测试我们关于100多个玩家的行为和经历演示的方法。我们的研究结果表明,所创造的代理者展现了独特的游戏风格和体验,因为他们设计要模仿游戏。重要的是,它似乎具有一种高度丰富的探索性的行为。它可以把一种高度的探索性的行为捆绑绑绑起来。