3DDESER: 迈向光现实的 3D 对象生成并使用文本制导扩散模型编辑 (3DDesigner: Towards Photorealistic 3D Object Generation and Editing with Text-guided Diffusion Models) - 专知论文

会员服务 ·

0

Performer · MoDELS · 3D · Processing（编程语言） · 奇虎 360 ·

2022 年 12 月 2 日

3DDesigner: Towards Photorealistic 3D Object Generation and Editing with Text-guided Diffusion Models

翻译：3DDESER: 迈向光现实的 3D 对象生成并使用文本制导扩散模型编辑

Gang Li,Heliang Zheng,Chaoyue Wang,Chang Li,Changwen Zheng,Dacheng Tao

from arxiv, 15 pages, 12 figures, conference

Text-guided diffusion models have shown superior performance in image/video generation and editing. While few explorations have been performed in 3D scenarios. In this paper, we discuss three fundamental and interesting problems on this topic. First, we equip text-guided diffusion models to achieve $\textbf{3D-consistent generation}$. Specifically, we integrate a NeRF-like neural field to generate low-resolution coarse results for a given camera view. Such results can provide 3D priors as condition information for the following diffusion process. During denoising diffusion, we further enhance the 3D consistency by modeling cross-view correspondences with a novel two-stream (corresponding to two different views) asynchronous diffusion process. Second, we study $\textbf{3D local editing}$ and propose a two-step solution that can generate 360$^{\circ}$ manipulated results by editing an object from a single view. Step 1, we propose to perform 2D local editing by blending the predicted noises. Step 2, we conduct a noise-to-text inversion process that maps 2D blended noises into the view-independent text embedding space. Once the corresponding text embedding is obtained, 360$^{\circ}$ images can be generated. Last but not least, we extend our model to perform \textbf{one-shot novel view synthesis} by fine-tuning on a single image, firstly showing the potential of leveraging text guidance for novel view synthesis. Extensive experiments and various applications show the prowess of our 3DDesigner. The project page is available at https://3ddesigner-diffusion.github.io/.

翻译：文本制导的传播模型在图像/ 视频生成和编辑中表现优异。虽然在 3D 情景中很少进行探索。在本文中, 我们讨论三个基本和有趣的问题。首先, 我们装备了文本制导的传播模型, 以达到 $\ textbf{ 3D- concistent 生成 $。具体地说, 我们整合了一个类似 NeRF 的神经字段, 以生成一个特定相机视图的低分辨率粗缩结果。这些结果可以提供 3D 前端信息, 作为随后的传播进程的条件。在拆译的传播过程中, 我们进一步增强 3D 一致性, 我们用新颖的双流应用( 响应两种不同的观点) 建模交叉视图。第二, 我们研究 $\ textb{ { 3D- 本地编辑} 并提议一个两步式解决方案, 通过编辑一个单一视图来生成 360$\ circrc} 。我们提议通过混合预测的噪音进行 2D 本地编辑。我们第一次进行噪动到正版的图像浏览浏览中, 。

0

相关内容

Performer

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【CVPR 2022】可控图像合成与编辑的合成生成先验学习，SemanticStyleGAN: Learning Compositonal Generative Priors for Controllable Image Synthesis and Editing

【CVPR 2022】可控图像合成与编辑的合成生成先验学习，SemanticStyleGAN: Learning Compositonal Generative Priors for Controllable Image Synthesis and Editing

专知会员服务

23+阅读 · 2022年3月3日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

基于氮化铝/石墨烯复合材料系统的高性能深紫外线探测器的研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向与IC工艺兼容的Si（100）衬底高亮度3D发光器件研究

国家自然科学基金

0+阅读 · 2014年12月31日

BDNF对睾丸间质细胞睾酮合成的影响及机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于同步辐射的硬X射线探测器标定方法及关键技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

Diversin介导非小细胞肺癌长春瑞滨耐药的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

CyPA-EMMPRIN相互作用与动脉粥样硬化及斑块稳定性的关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

表面等离激元增强宽光谱InGaN太阳能电池研究

国家自然科学基金

0+阅读 · 2012年12月31日

蒽醌/石墨烯纳米复合材料电极的电催化氧还原性能及其在异相electro-Fenton-like体系中的应用研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

兔出血症病毒受体的研究

国家自然科学基金

0+阅读 · 2008年12月31日

TEXTure: Text-Guided Texturing of 3D Shapes

Arxiv

0+阅读 · 2023年2月3日

LIQUID: A Framework for List Question Answering Dataset Generation

Arxiv

0+阅读 · 2023年2月3日

Dreamix: Video Diffusion Models are General Video Editors

Arxiv

0+阅读 · 2023年2月2日

Get3DHuman: Lifting StyleGAN-Human into a 3D Generative Model using Pixel-aligned Reconstruction Priors

Arxiv

0+阅读 · 2023年2月2日

Model selection-based estimation for generalized additive models using mixtures of g-priors: Towards systematization

Arxiv

0+阅读 · 2023年2月1日

Neural Wavelet-domain Diffusion for 3D Shape Generation, Inversion, and Manipulation

Arxiv

0+阅读 · 2023年2月1日

Learning Universal Policies via Text-Guided Video Generation

Arxiv

0+阅读 · 2023年1月31日

Diffusion Models in Vision: A Survey

Arxiv

29+阅读 · 2022年9月10日

Understanding Diffusion Models: A Unified Perspective

Arxiv

14+阅读 · 2022年8月25日

Generative Adversarial Networks and Probabilistic Graph Models for Hyperspectral Image Classification

Arxiv

11+阅读 · 2018年2月10日

VIP会员

文章信息

相关主题

Processing（编程语言）

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【CVPR 2022】可控图像合成与编辑的合成生成先验学习，SemanticStyleGAN: Learning Compositonal Generative Priors for Controllable Image Synthesis and Editing

【CVPR 2022】可控图像合成与编辑的合成生成先验学习，SemanticStyleGAN: Learning Compositonal Generative Priors for Controllable Image Synthesis and Editing

专知会员服务

23+阅读 · 2022年3月3日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】以人为中心的强化学习

任务规划与地形分析：现代复杂环境作战导航体系

认知优势：人工智能在国家安全决策中的核心作用

大模型赋能的具身智能：决策与具身学习综述

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

TEXTure: Text-Guided Texturing of 3D Shapes

Arxiv

0+阅读 · 2023年2月3日

LIQUID: A Framework for List Question Answering Dataset Generation

Arxiv

0+阅读 · 2023年2月3日

Dreamix: Video Diffusion Models are General Video Editors

Arxiv

0+阅读 · 2023年2月2日

Get3DHuman: Lifting StyleGAN-Human into a 3D Generative Model using Pixel-aligned Reconstruction Priors

Arxiv

0+阅读 · 2023年2月2日

Model selection-based estimation for generalized additive models using mixtures of g-priors: Towards systematization

Arxiv

0+阅读 · 2023年2月1日

Neural Wavelet-domain Diffusion for 3D Shape Generation, Inversion, and Manipulation

Arxiv

0+阅读 · 2023年2月1日

Learning Universal Policies via Text-Guided Video Generation

Arxiv

0+阅读 · 2023年1月31日

Diffusion Models in Vision: A Survey

Arxiv

29+阅读 · 2022年9月10日

Understanding Diffusion Models: A Unified Perspective

Arxiv

14+阅读 · 2022年8月25日

Generative Adversarial Networks and Probabilistic Graph Models for Hyperspectral Image Classification

Arxiv

11+阅读 · 2018年2月10日

相关基金

基于氮化铝/石墨烯复合材料系统的高性能深紫外线探测器的研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向与IC工艺兼容的Si（100）衬底高亮度3D发光器件研究

国家自然科学基金

0+阅读 · 2014年12月31日

BDNF对睾丸间质细胞睾酮合成的影响及机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于同步辐射的硬X射线探测器标定方法及关键技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

Diversin介导非小细胞肺癌长春瑞滨耐药的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

CyPA-EMMPRIN相互作用与动脉粥样硬化及斑块稳定性的关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

表面等离激元增强宽光谱InGaN太阳能电池研究

国家自然科学基金

0+阅读 · 2012年12月31日

蒽醌/石墨烯纳米复合材料电极的电催化氧还原性能及其在异相electro-Fenton-like体系中的应用研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

兔出血症病毒受体的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员