Text-based games have emerged as an important test-bed for Reinforcement Learning (RL) research, requiring RL agents to combine grounded language understanding with sequential decision making. In this paper, we examine the problem of infusing RL agents with commonsense knowledge. Such knowledge would allow agents to efficiently act in the world by pruning out implausible actions, and to perform look-ahead planning to determine how current actions might affect future world states. We design a new text-based gaming environment called TextWorld Commonsense (TWC) for training and evaluating RL agents with a specific kind of commonsense knowledge about objects, their attributes, and affordances. We also introduce several baseline RL agents which track the sequential context and dynamically retrieve the relevant commonsense knowledge from ConceptNet. We show that agents which incorporate commonsense knowledge in TWC perform better, while acting more efficiently. We conduct user-studies to estimate human performance on TWC and show that there is ample room for future improvement.
翻译:以文字为基础的游戏已成为强化学习研究的一个重要测试台,要求学习者将基于语言的理解与顺序决策相结合。在本文中,我们研究了以常识知识传播RL剂的问题。这种知识将使行为者能够通过排除不可信的行动,高效率地在世界上行动,并进行目光领先的规划,以确定当前行动如何影响未来世界国家。我们设计了一个新的以文字为基础的游戏环境,名为TextWorld Commonsense(TWC),用于培训和评价对物体、其属性和负担能力具有特定常识知识的RL剂。我们还引入了几个基线RL剂,跟踪连续环境,动态地从概念网络检索相关的常识。我们表明,将常识知识纳入TWC的代理人表现更好,同时更有效地行动。我们进行用户-研究,以估计人类在TEC的表现,并显示今后有充足的改进空间。