推进扩散:用于新物体的语义重新排列的物体中心扩散 (StructDiffusion: Object-Centric Diffusion for Semantic Rearrangement of Novel Objects)

Robots operating in human environments must be able to rearrange objects into semantically-meaningful configurations, even if these objects are previously unseen. In this work, we focus on the problem of building physically-valid structures without step-by-step instructions. We propose StructDiffusion, which combines a diffusion model and an object-centric transformer to construct structures out of a single RGB-D image based on high-level language goals, such as "set the table." Our method shows how diffusion models can be used for complex multi-step 3D planning tasks. StructDiffusion improves success rate on assembling physically-valid structures out of unseen objects by on average 16% over an existing multi-modal transformer model, while allowing us to use one multi-task model to produce a wider range of different structures. We show experiments on held-out objects in both simulation and on real-world rearrangement tasks. For videos and additional results, check out our website: http://weiyuliu.com/StructDiffusion/.

翻译：在人类环境中运行的机器人必须能够将物体重新排列成具有语义意义的配置,即使这些物体以前是看不见的。在这项工作中,我们侧重于在没有一步步指令的情况下建立物理有效结构的问题。我们提议SstructDifulation, 它将一个扩散模型和一个以物体为中心的变压器结合起来, 以基于高层次语言目标( 如“ 设置表格 ” ) 的单个 RGB- D 图像来构造结构。我们的方法显示如何将扩散模型用于复杂的多步骤 3D 规划任务。 StructDifmission 将物理有效结构从不可见物体中收集的成功率平均提高16%, 超过现有的多模式变压器模型, 同时允许我们使用一个多任务模型来产生更广泛的不同结构。我们在模拟和真实世界的重新排列任务中展示对悬停物体的实验。关于视频和其他结果,请查看我们的网站: http://weiuli.com/StructDifvilation/。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

专知会员服务

127+阅读 · 2019年12月13日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日