中文本2laent: 使用拆分扩散和 CLIP 校准StyleGAN 校准前StyleGAN 的文本驱动取样 (clip2latent: Text driven sampling of a pre-trained StyleGAN using denoising diffusion and CLIP) - 专知论文

会员服务 ·

0

StyleGAN · MoDELS · 样本 · 去噪 · 控制器 ·

2022 年 10 月 5 日

clip2latent: Text driven sampling of a pre-trained StyleGAN using denoising diffusion and CLIP

翻译：中文本2laent: 使用拆分扩散和 CLIP 校准StyleGAN 校准前StyleGAN 的文本驱动取样

Justin N. M. Pinkney,Chuan Li

from arxiv, Accepted to BMVC 2022

We introduce a new method to efficiently create text-to-image models from a pre-trained CLIP and StyleGAN. It enables text driven sampling with an existing generative model without any external data or fine-tuning. This is achieved by training a diffusion model conditioned on CLIP embeddings to sample latent vectors of a pre-trained StyleGAN, which we call clip2latent. We leverage the alignment between CLIP's image and text embeddings to avoid the need for any text labelled data for training the conditional diffusion model. We demonstrate that clip2latent allows us to generate high-resolution (1024x1024 pixels) images based on text prompts with fast sampling, high image quality, and low training compute and data requirements. We also show that the use of the well studied StyleGAN architecture, without further fine-tuning, allows us to directly apply existing methods to control and modify the generated images adding a further layer of control to our text-to-image pipeline.

翻译：我们引入了一种新方法, 高效地从训练有素的 CLIP 和 StyleGAN 中创建文本到图像模型。它使得以文本驱动的取样能够以现有的基因模型进行,而无需任何外部数据或微调。这是通过培训一个以CLIP嵌入预先培训的StyleGAN 的潜在矢量为条件的传播模型为条件的传播模型为条件的传播模型的传播模型, 我们称之为剪动。我们利用CLIP 图像和文本嵌入之间的匹配, 以避免需要任何标记的文本数据来培训有条件的传播模型。我们证明剪动能让我们生成基于文本提示的高分辨率( 1024x1024 像素) 图像, 以快速取样、高图像质量、低培训计算和数据要求的文本提示为基础。我们还表明, 使用经过良好研究的StyGAN 结构, 无需进一步微调, 就能直接应用现有方法来控制和修改生成的图像, 给我们的文本到图像管道增加一层控制。

0

相关内容

StyleGAN

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

专知会员服务

27+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

化疗诱导的细胞衰老在神经母细胞瘤复发中的作用及分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

CCN6通过IGF-1通路调节软骨基质代谢平衡的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

中短波紫外线对采后番茄果实酚类代谢的调控机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

纤维结构不良中破骨细胞过度激活的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

ADAMTS1改变细胞微环境对脂肪细胞定向的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

IRF1在诱导分化过程中全基因组水平调控机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

相互作用暗能量模型

国家自然科学基金

0+阅读 · 2011年12月31日

前列腺癌转移抑制基因CRMP4及其调控机制的研究

国家自然科学基金

0+阅读 · 2009年12月31日

miR146a抑制Smad4对骨髓间充质干细胞成骨分化调控的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Masked Supervised Learning for Semantic Segmentation

Arxiv

0+阅读 · 2022年11月8日

eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers

Arxiv

0+阅读 · 2022年11月8日

Collaboration of Pre-trained Models Makes Better Few-shot Learner

Arxiv

0+阅读 · 2022年11月7日

Few-shot Image Generation with Diffusion Models

Arxiv

0+阅读 · 2022年11月7日

eDiffi: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers

Arxiv

0+阅读 · 2022年11月6日

A Survey on Generative Diffusion Model

Arxiv

46+阅读 · 2022年9月6日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

Data Augmentation using Pre-trained Transformer Models

Arxiv

17+阅读 · 2020年3月4日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

专知会员服务

27+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【伯克利博士论文】通过真实世界实践赋能机器人自主性

军用无人机集群技术尚未成熟——但潜力可期

人工智能安全治理白皮书（2025）

AgentOps综述：分类、挑战与未来方向

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

相关论文

Masked Supervised Learning for Semantic Segmentation

Arxiv

0+阅读 · 2022年11月8日

eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers

Arxiv

0+阅读 · 2022年11月8日

Collaboration of Pre-trained Models Makes Better Few-shot Learner

Arxiv

0+阅读 · 2022年11月7日

Few-shot Image Generation with Diffusion Models

Arxiv

0+阅读 · 2022年11月7日

eDiffi: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers

Arxiv

0+阅读 · 2022年11月6日

A Survey on Generative Diffusion Model

Arxiv

46+阅读 · 2022年9月6日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

Data Augmentation using Pre-trained Transformer Models

Arxiv

17+阅读 · 2020年3月4日

相关基金

化疗诱导的细胞衰老在神经母细胞瘤复发中的作用及分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

CCN6通过IGF-1通路调节软骨基质代谢平衡的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

中短波紫外线对采后番茄果实酚类代谢的调控机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

纤维结构不良中破骨细胞过度激活的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

ADAMTS1改变细胞微环境对脂肪细胞定向的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

IRF1在诱导分化过程中全基因组水平调控机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

相互作用暗能量模型

国家自然科学基金

0+阅读 · 2011年12月31日

前列腺癌转移抑制基因CRMP4及其调控机制的研究

国家自然科学基金

0+阅读 · 2009年12月31日

miR146a抑制Smad4对骨髓间充质干细胞成骨分化调控的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员