TextCraft: 从文字中零热生成高纤维性和多样化形状 (TextCraft: Zero-Shot Generation of High-Fidelity and Diverse Shapes from Text) - 专知论文

会员服务 ·

0

塑造 · 多样性 · 逼真度 · Extensibility · 潜在 ·

2022 年 11 月 2 日

TextCraft: Zero-Shot Generation of High-Fidelity and Diverse Shapes from Text

翻译：TextCraft: 从文字中零热生成高纤维性和多样化形状

Aditya Sanghi,Rao Fu,Vivian Liu,Karl Willis,Hooman Shayani,Amir Hosein Khasahmadi,Srinath Sridhar,Daniel Ritchie

Language is one of the primary means by which we describe the 3D world around us. While rapid progress has been made in text-to-2D-image synthesis, similar progress in text-to-3D-shape synthesis has been hindered by the lack of paired (text, shape) data. Moreover, extant methods for text-to-shape generation have limited shape diversity and fidelity. We introduce TextCraft, a method to address these limitations by producing high-fidelity and diverse 3D shapes without the need for (text, shape) pairs for training. TextCraft achieves this by using CLIP and using a multi-resolution approach by first generating in a low-dimensional latent space and then upscaling to a higher resolution, improving the fidelity of the generated shape. To improve shape diversity, we use a discrete latent space which is modelled using a bidirectional transformer conditioned on the interchangeable image-text embedding space induced by CLIP. Moreover, we present a novel variant of classifier-free guidance, which further improves the accuracy-diversity trade-off. Finally, we perform extensive experiments that demonstrate that TextCraft outperforms state-of-the-art baselines.

翻译：语言是我们描述周围3D世界的主要手段之一。虽然在文本到-2D图像合成方面已经取得了快速的进展,但文本到-3D形状合成方面的进展却因缺少配对(文本、形状)数据而受阻。此外,文本到形状生成的剩余方法的形状多样性和忠诚度有限。我们引入了TextCraft, 这是一种通过产生高纤维性和不同的3D形状来克服这些局限性的方法,而无需(文本、形状)对培训的配对。 TextCraft 实现了这一点,它使用CLIP 和多分辨率方法,首先在低维潜层空间生成,然后提升到更高的分辨率,提高生成形状的正性。为了改进多样性,我们使用一个离散的潜伏空间,该空间以可互换的图像嵌入空间(CLIP)为模型。此外,我们提出了一个新的非分类指导变式,它进一步改进了精准度贸易基础。最后,我们进行了广泛的实验,展示了精确度-变换的文本。

0

相关内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

“温度自补偿型”应力传感气凝胶材料的可控制备

国家自然科学基金

0+阅读 · 2015年12月31日

高压下聚合物纳米复合材料熔体的粘弹性行为研究及其调控机制

国家自然科学基金

0+阅读 · 2015年12月31日

稀土MOF纳米荧光探针的设计合成及其生物应用

国家自然科学基金

0+阅读 · 2013年12月31日

MicroRNA-607靶向RKIP调控鼻咽癌放疗敏感性的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

煤巷锚杆树脂锚固体空洞现象及其力学特性

国家自然科学基金

0+阅读 · 2013年12月31日

CHOP 调控ERO1α在急性肝损伤中的作用及其机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

CuInGaSe2太阳能电池界面结构、界面态及其钝化

国家自然科学基金

0+阅读 · 2012年12月31日

高分子量sb-PLA的合成及其sc-PLA材料结构与性能

国家自然科学基金

0+阅读 · 2012年12月31日

急性肾损伤尿液生物标记物筛选及其在疾病发病中机制探讨

国家自然科学基金

0+阅读 · 2010年12月31日

恶性肿瘤细胞凋亡新型小分子PET显像剂的研制

国家自然科学基金

0+阅读 · 2009年12月31日

Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Arxiv

0+阅读 · 2022年12月22日

Not Just Pretty Pictures: Text-to-Image Generators Enable Interpretable Interventions for Robust Representations

Arxiv

0+阅读 · 2022年12月21日

Land Cover and Land Use Detection using Semi-Supervised Learning

Land Cover and Land Use Detection using Semi-Supervised Learning

Arxiv

0+阅读 · 2022年12月21日

Hi-LASSIE: High-Fidelity Articulated Shape and Skeleton Discovery from Sparse Image Ensemble

Hi-LASSIE: High-Fidelity Articulated Shape and Skeleton Discovery from Sparse Image Ensemble

Arxiv

0+阅读 · 2022年12月21日

Critic-Guided Decoding for Controlled Text Generation

Arxiv

0+阅读 · 2022年12月21日

3D Point Cloud Pre-training with Knowledge Distillation from 2D Images

Arxiv

1+阅读 · 2022年12月17日

Point-E: A System for Generating 3D Point Clouds from Complex Prompts

Arxiv

0+阅读 · 2022年12月16日

Self-Supervised Pre-training of 3D Point Cloud Networks with Image Data

Arxiv

0+阅读 · 2022年12月16日

Recovering 3D Human Mesh from Monocular Images: A Survey

Arxiv

12+阅读 · 2022年3月8日

Diverse Image-to-Image Translation via Disentangled Representations

Diverse Image-to-Image Translation via Disentangled Representations

Arxiv

13+阅读 · 2018年8月2日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

俄乌战争启示：坦克战与不断演变的战斗形态

《大规模作战行动中与无人机集成的C5ISR系统》

《主观概率约束下寻找可行系统及其军事应用》69页

《美政府问责局：多种挑战影响地面战车任务出勤率》2025最新130页

相关资讯

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Arxiv

0+阅读 · 2022年12月22日

Not Just Pretty Pictures: Text-to-Image Generators Enable Interpretable Interventions for Robust Representations

Arxiv

0+阅读 · 2022年12月21日

Land Cover and Land Use Detection using Semi-Supervised Learning

Land Cover and Land Use Detection using Semi-Supervised Learning

Arxiv

0+阅读 · 2022年12月21日

Hi-LASSIE: High-Fidelity Articulated Shape and Skeleton Discovery from Sparse Image Ensemble

Hi-LASSIE: High-Fidelity Articulated Shape and Skeleton Discovery from Sparse Image Ensemble

Arxiv

0+阅读 · 2022年12月21日

Critic-Guided Decoding for Controlled Text Generation

Arxiv

0+阅读 · 2022年12月21日

3D Point Cloud Pre-training with Knowledge Distillation from 2D Images

Arxiv

1+阅读 · 2022年12月17日

Point-E: A System for Generating 3D Point Clouds from Complex Prompts

Arxiv

0+阅读 · 2022年12月16日

Self-Supervised Pre-training of 3D Point Cloud Networks with Image Data

Arxiv

0+阅读 · 2022年12月16日

Recovering 3D Human Mesh from Monocular Images: A Survey

Arxiv

12+阅读 · 2022年3月8日

Diverse Image-to-Image Translation via Disentangled Representations

Diverse Image-to-Image Translation via Disentangled Representations

Arxiv

13+阅读 · 2018年8月2日

相关基金

“温度自补偿型”应力传感气凝胶材料的可控制备

国家自然科学基金

0+阅读 · 2015年12月31日

高压下聚合物纳米复合材料熔体的粘弹性行为研究及其调控机制

国家自然科学基金

0+阅读 · 2015年12月31日

稀土MOF纳米荧光探针的设计合成及其生物应用

国家自然科学基金

0+阅读 · 2013年12月31日

MicroRNA-607靶向RKIP调控鼻咽癌放疗敏感性的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

煤巷锚杆树脂锚固体空洞现象及其力学特性

国家自然科学基金

0+阅读 · 2013年12月31日

CHOP 调控ERO1α在急性肝损伤中的作用及其机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

CuInGaSe2太阳能电池界面结构、界面态及其钝化

国家自然科学基金

0+阅读 · 2012年12月31日

高分子量sb-PLA的合成及其sc-PLA材料结构与性能

国家自然科学基金

0+阅读 · 2012年12月31日

急性肾损伤尿液生物标记物筛选及其在疾病发病中机制探讨

国家自然科学基金

0+阅读 · 2010年12月31日

恶性肿瘤细胞凋亡新型小分子PET显像剂的研制

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员