Language-based environment manipulation requires agents to manipulate the environment following natural language instructions, which is challenging due to the huge space of the environments. To address this challenge, various approaches have been proposed in recent work. Although these approaches work well for their intended environments, they are difficult to generalize across environments. In this work, we propose LEMON, a general framework for language-based environment manipulation tasks. Specifically, we first specify a task-agnostic approach for language-based environment manipulation tasks, which can deal with various environments using the same generative language model. Then we propose an execution-guided pre-training strategy to inject prior knowledge of environments to the language model with a pure synthetic pre-training corpus. Experimental results on tasks including Alchemy, Scene, Tangrams, ProPara and Recipes demonstrate the effectiveness of LEMON: it achieves new state-of-the-art results on four of the tasks, and the execution-guided pre-training strategy brings remarkable improvements on all experimental tasks.
翻译:以语言为基础的环境操纵要求代理商按照自然语言指令操控环境,由于环境空间巨大,这具有挑战性。为了应对这一挑战,最近的工作提出了各种办法。虽然这些办法在预期环境中运作良好,但很难在各种环境中加以概括。在这项工作中,我们提议LEMON为以语言为基础的环境操纵任务提供一个总框架。具体地说,我们首先为基于语言的环境操纵任务规定一种任务-不可知性办法,这种任务可以使用同样的基因化语言模式处理各种环境。然后,我们提出一项执行指导的训练前战略,将事先的环境知识注入语言模型,并配有纯合成的训练前材料。关于Alchemy、Scene、Tangrams、ProPara和Reipes等任务的实验结果展示了LEMON:它在四项任务中取得了新的最新成果,而执行指导的训练前战略使所有实验任务都取得了显著的改进。