This paper demonstrates how OpenAI's ChatGPT can be used in a few-shot setting to convert natural language instructions into a sequence of executable robot actions. The paper proposes easy-to-customize input prompts for ChatGPT that meet common requirements in practical applications, such as easy integration with robot execution systems and applicability to various environments while minimizing the impact of ChatGPT's token limit. The prompts encourage ChatGPT to output a sequence of predefined robot actions, represent the operating environment in a formalized style, and infer the updated state of the operating environment. Experiments confirmed that the proposed prompts enable ChatGPT to act according to requirements in various environments, and users can adjust ChatGPT's output with natural language feedback for safe and robust operation. The proposed prompts and source code are open-source and publicly available at https://github.com/microsoft/ChatGPT-Robot-Manipulation-Prompts
翻译:本文展示了如何在 Few-shot 设置下使用 OpenAI 的 ChatGPT 将自然语言指令转化为可执行机器人动作序列。本文提供了易于定制的 ChatGPT 输入提示,以满足实际应用中的常见要求,如与机器人执行系统的容易集成和适用于各种环境,同时最大程度地减少 ChatGPT 的令牌限制的影响。这些提示鼓励 ChatGPT 输出一系列预定义的机器人动作、规范化描述操作环境并推断操作环境的更新状态。实验证明,所提供的提示使 ChatGPT 能够根据各种环境的要求执行动作,用户还可以用自然语言反馈调整 ChatGPT 的输出,实现安全和稳健的操作。所提供的提示和源代码是开源的,公开可用于 https://github.com/microsoft/ChatGPT-Robot-Manipulation-Prompts。