底底代号: 以机器人控制基底模型编制指导性文本</s> (Grounded Decoding: Guiding Text Generation with Grounded Models for Robot Control)

Recent progress in large language models (LLMs) has demonstrated the ability to learn and leverage Internet-scale knowledge through pre-training with autoregressive models. Unfortunately, applying such models to settings with embodied agents, such as robots, is challenging due to their lack of experience with the physical world, inability to parse non-language observations, and ignorance of rewards or safety constraints that robots may require. On the other hand, language-conditioned robotic policies that learn from interaction data can provide the necessary grounding that allows the agent to be correctly situated in the real world, but such policies are limited by the lack of high-level semantic understanding due to the limited breadth of the interaction data available for training them. Thus, if we want to make use of the semantic knowledge in a language model while still situating it in an embodied setting, we must construct an action sequence that is both likely according to the language model and also realizable according to grounded models of the environment. We frame this as a problem similar to probabilistic filtering: decode a sequence that both has high probability under the language model and high probability under a set of grounded model objectives. We demonstrate this guided decoding strategy is able to solve complex, long-horizon embodiment tasks in a robotic setting by leveraging the knowledge of both models. The project's website can be found at grounded-decoding.github.io.

翻译：大型语言模型(LLMS)最近的进展表明,通过自动递减模型的预培训,有能力学习和利用因特网规模的知识。不幸的是,将这种模型应用于机器人等内装剂的环境下,由于在物理世界中缺乏经验,无法分析非语言观测,不了解机器人可能需要的奖励或安全限制,因此具有挑战性。另一方面,从互动数据学习的有语言限制的机器人政策可以提供必要的基础,使代理商能够正确处于真实世界中,但这种政策因缺乏高层次的语义理解而受到限制,因为用于培训的交互数据范围有限。因此,如果我们想在语言模型中使用语义知识,同时将它置于一个内装饰环境环境中,我们就必须构建一个既可能根据语言模型进行,又能根据基于环境模型进行现实的行动序列。我们将此设定为类似于概率过滤的问题:在语言模型下解码一个高度概率的序列,而在一套基础性模型目标下也具有很高的概率。我们可以在一套深层次的模型下,在一套深层次的模型下,用这个模型来演示这个模型的解算方法。我们可以在复杂的模型上找到一个解算法。</s>

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日