Extracting the common sense knowledge present in Large Language Models (LLMs) offers a path to designing intelligent, embodied agents. Related works have queried LLMs with a wide-range of contextual information, such as goals, sensor observations and scene descriptions, to generate high-level action plans for specific tasks; however these approaches often involve human intervention or additional machinery to enable sensor-motor interactions. In this work, we propose a prompting-based strategy for extracting executable plans from an LLM, which leverages a novel and readily-accessible source of information: precondition errors. Our approach assumes that actions are only afforded execution in certain contexts, i.e., implicit preconditions must be met for an action to execute (e.g., a door must be unlocked to open it), and that the embodied agent has the ability to determine if the action is/is not executable in the current context (e.g., detect if a precondition error is present). When an agent is unable to execute an action, our approach re-prompts the LLM with precondition error information to extract an executable corrective action to achieve the intended goal in the current context. We evaluate our approach in the VirtualHome simulation environment on 88 different tasks and 7 scenes. We evaluate different prompt templates and compare to methods that naively re-sample actions from the LLM. Our approach, using precondition errors, improves executability and semantic correctness of plans, while also reducing the number of re-prompts required when querying actions.
翻译:大语言模型(LLMS)中存在的常识知识为设计智能化集成剂提供了一条途径。相关工作向具有广泛背景信息的LLMS询问了具有广泛背景信息的LLMs,如目标、传感器观测和场景描述等,以产生针对具体任务的高级别行动计划;然而,这些方法往往涉及人力干预或额外机制,以促成感官与运动的互动。在这项工作中,我们提议了一个基于快速的策略,从一个LLMM中提取可执行的计划,利用一个新颖和易于获取的信息来源:前提错误。我们的方法假定,行动只有在特定情况下才会得到执行,即必须满足执行行动所需的隐含的先决条件(例如,必须打开一个门以打开它),而且,为具体代理人有能力确定行动在当前背景下是否可执行(例如,检测是否存在一个先决条件错误),当一个代理人无法执行一项行动时,我们的方法是重新编写。我们的方法假定LM的错误信息只有在某些情况下才能执行,即满足执行行动的隐含的先决条件,即必须满足执行行动的隐含的先决条件(例如,门必须打开门打开门打开门打开门打开门以达到预期目标),同时,在不同的虚拟模型上,我们需要对我们的正确的路径进行实地评估。我们使用我们所用的方法,然后对我们的正确性评估。我们所使用的方法进行实地评估。我们所使用的方法,在使用我们所使用的方法进行实地评估。我们所使用的方法,要改进。