零3D: 语义驱动多类3D形状生成 (Zero3D: Semantic-Driven Multi-Category 3D Shape Generation) - 专知论文

会员服务 ·

0

塑造 · 3D · MoDELS · Pair · 向量化 ·

2023 年 1 月 31 日

Zero3D: Semantic-Driven Multi-Category 3D Shape Generation

翻译：零3D: 语义驱动多类3D形状生成

Bo Han,Yitong Liu,Yixuan Shen

Semantic-driven 3D shape generation aims to generate 3D objects conditioned on text. Previous works face problems with single-category generation, low-frequency 3D details, and requiring a large number of paired datasets for training. To tackle these challenges, we propose a multi-category conditional diffusion model. Specifically, 1) to alleviate the problem of lack of large-scale paired data, we bridge the text, 2D image and 3D shape based on the pre-trained CLIP model, and 2) to obtain the multi-category 3D shape feature, we apply the conditional flow model to generate 3D shape vector conditioned on CLIP embedding. 3) to generate multi-category 3D shape, we employ the hidden-layer diffusion model conditioned on the multi-category shape vector, which greatly reduces the training time and memory consumption.

翻译：以语义驱动的 3D 形状生成旨在生成以文本为条件的 3D 对象。以往的工作面临单类生成、低频 3D 细节和需要大量配对数据集的培训问题。为了应对这些挑战, 我们提议了一个多类有条件的扩展模式。具体地说, 1) 缓解缺少大型配对数据的问题, 我们根据经过预先培训的 CLIP 模型将文本、 2D 图像和 3D 形状连接起来, 2) 获取多类 3D 形状特征, 我们使用有条件的流程模式生成以 CLIP 嵌入为条件的 3D 形状矢量。 3) 生成多类 3D 形状, 我们使用以多类形状矢量为条件的隐藏层扩散模式, 这极大地减少了培训时间和记忆消耗。

0

相关内容

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

专知会员服务

21+阅读 · 2022年3月18日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

AI科技评论

21+阅读 · 2019年6月23日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

LncRNA-TC0101441抑制KiSS-1促进卵巢癌侵袭转移的作用及分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

酒酒球菌SD-2a抗逆相关基因表达模式和功能的研究

国家自然科学基金

0+阅读 · 2014年12月31日

β-catenin/Ets1复合体在胶质母细胞瘤中对hTERT表达调控机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

表面粗糙元形状对高超声速边界层稳定性和转捩的影响

国家自然科学基金

0+阅读 · 2013年12月31日

PTBP1介导的survivinΔEx3过表达调控胶质母细胞瘤微血管增生的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

胶质瘤干细胞的特异性核酸适体筛选及其在胶质瘤靶向药物治疗中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

多酚与糖蛋白相互作用与其聚集稳定性关系

国家自然科学基金

0+阅读 · 2012年12月31日

新型端粒酶TERT抑制剂：手性吡唑-香豆素-色酮新骨架的优化设计合成及构效关系

国家自然科学基金

0+阅读 · 2012年12月31日

Period2基因调控人胶质瘤细胞凋亡的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

金属巯基配合物的阴离子识别传感研究

国家自然科学基金

0+阅读 · 2009年12月31日

Set-the-Scene: Global-Local Training for Generating Controllable NeRF Scenes

Arxiv

0+阅读 · 2023年3月23日

Medical diffusion on a budget: textual inversion for medical image generation

Arxiv

0+阅读 · 2023年3月23日

TAPS3D: Text-Guided 3D Textured Shape Generation from Pseudo Supervision

Arxiv

0+阅读 · 2023年3月23日

CA$^2$T-Net: Category-Agnostic 3D Articulation Transfer from Single Image

Arxiv

0+阅读 · 2023年3月22日

VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation

Arxiv

0+阅读 · 2023年3月22日

CC3D: Layout-Conditioned Generation of Compositional 3D Scenes

Arxiv

0+阅读 · 2023年3月21日

Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image Models

Arxiv

0+阅读 · 2023年3月21日

Prime Sample Attention in Object Detection

Arxiv

13+阅读 · 2019年4月9日

A 3D Coarse-to-Fine Framework for Volumetric Medical Image Segmentation

A 3D Coarse-to-Fine Framework for Volumetric Medical Image Segmentation

Arxiv

15+阅读 · 2018年8月2日

An application of cascaded 3D fully convolutional networks for medical image segmentation

Arxiv

10+阅读 · 2018年3月20日

VIP会员

文章信息

相关主题

相关VIP内容

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

专知会员服务

21+阅读 · 2022年3月18日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

网络科学赋能人工智能: 现状与展望

【NeurIPS2025教程】解释人工智能模型：可解释人工智能、数据中心人工智能与机制可解释性的方法与机遇

人工智能赋能作战行动：以俄乌战争为例

【ETHZ博士论文】表征学习在推进深度学习中的作用：效率、可扩展性与推理

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

AI科技评论

21+阅读 · 2019年6月23日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

相关论文

Set-the-Scene: Global-Local Training for Generating Controllable NeRF Scenes

Arxiv

0+阅读 · 2023年3月23日

Medical diffusion on a budget: textual inversion for medical image generation

Arxiv

0+阅读 · 2023年3月23日

TAPS3D: Text-Guided 3D Textured Shape Generation from Pseudo Supervision

Arxiv

0+阅读 · 2023年3月23日

CA$^2$T-Net: Category-Agnostic 3D Articulation Transfer from Single Image

Arxiv

0+阅读 · 2023年3月22日

VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation

Arxiv

0+阅读 · 2023年3月22日

CC3D: Layout-Conditioned Generation of Compositional 3D Scenes

Arxiv

0+阅读 · 2023年3月21日

Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image Models

Arxiv

0+阅读 · 2023年3月21日

Prime Sample Attention in Object Detection

Arxiv

13+阅读 · 2019年4月9日

A 3D Coarse-to-Fine Framework for Volumetric Medical Image Segmentation

A 3D Coarse-to-Fine Framework for Volumetric Medical Image Segmentation

Arxiv

15+阅读 · 2018年8月2日

An application of cascaded 3D fully convolutional networks for medical image segmentation

Arxiv

10+阅读 · 2018年3月20日

相关基金

LncRNA-TC0101441抑制KiSS-1促进卵巢癌侵袭转移的作用及分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

酒酒球菌SD-2a抗逆相关基因表达模式和功能的研究

国家自然科学基金

0+阅读 · 2014年12月31日

β-catenin/Ets1复合体在胶质母细胞瘤中对hTERT表达调控机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

表面粗糙元形状对高超声速边界层稳定性和转捩的影响

国家自然科学基金

0+阅读 · 2013年12月31日

PTBP1介导的survivinΔEx3过表达调控胶质母细胞瘤微血管增生的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

胶质瘤干细胞的特异性核酸适体筛选及其在胶质瘤靶向药物治疗中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

多酚与糖蛋白相互作用与其聚集稳定性关系

国家自然科学基金

0+阅读 · 2012年12月31日

新型端粒酶TERT抑制剂：手性吡唑-香豆素-色酮新骨架的优化设计合成及构效关系

国家自然科学基金

0+阅读 · 2012年12月31日

Period2基因调控人胶质瘤细胞凋亡的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

金属巯基配合物的阴离子识别传感研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员