更多 Free 控制控制! 图像合成, 带有语义扩散指导 (More Control for Free! Image Synthesis with Semantic Diffusion Guidance) - 专知论文

会员服务 ·

0

Guidance · 控制器 · MoDELS · Continuity · 样例 ·

2022 年 12 月 5 日

More Control for Free! Image Synthesis with Semantic Diffusion Guidance

翻译：更多 Free 控制控制! 图像合成, 带有语义扩散指导

Xihui Liu,Dong Huk Park,Samaneh Azadi,Gong Zhang,Arman Chopikyan,Yuxiao Hu,Humphrey Shi,Anna Rohrbach,Trevor Darrell

from arxiv, WACV 2023. Project page https://xh-liu.github.io/sdg/

Controllable image synthesis models allow creation of diverse images based on text instructions or guidance from a reference image. Recently, denoising diffusion probabilistic models have been shown to generate more realistic imagery than prior methods, and have been successfully demonstrated in unconditional and class-conditional settings. We investigate fine-grained, continuous control of this model class, and introduce a novel unified framework for semantic diffusion guidance, which allows either language or image guidance, or both. Guidance is injected into a pretrained unconditional diffusion model using the gradient of image-text or image matching scores, without re-training the diffusion model. We explore CLIP-based language guidance as well as both content and style-based image guidance in a unified framework. Our text-guided synthesis approach can be applied to datasets without associated text annotations. We conduct experiments on FFHQ and LSUN datasets, and show results on fine-grained text-guided image synthesis, synthesis of images related to a style or content reference image, and examples with both textual and image guidance.

翻译：可控图像合成模型允许根据参考图像的文本指示或指导制作不同图像。最近,已经展示了去除扩散概率模型,以产生比以往方法更现实的图像,并在无条件和等级条件设置中成功展示。我们调查了该模型类的精细和连续控制,并引入了一个新的语义扩散指导统一框架,允许语言或图像指导,或两者兼而有之。指南被注入一个预先训练的无条件传播模型,使用图像文本梯度或图像匹配得分,而不对传播模型进行再培训。我们探索了基于 CLIP 的语言指导以及内容和风格图像指导。我们的文本指导合成方法可以应用到数据集中,而没有相关的文本说明。我们在FFHQ 和 LSUN 数据集上进行了实验,并展示了精制文本指导图像合成的结果、与样式或内容参考图像相关的图像合成,以及带有文本和图像指导的示例。

0

相关内容

Guidance

【ACM UMAP 2022 】可复现推荐系统的语义感知内容表示，148页ppt

【ACM UMAP 2022 】可复现推荐系统的语义感知内容表示，148页ppt

专知会员服务

17+阅读 · 2022年7月6日

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【MM 2021】基于单张图像的多风格说话人合成，Imitating Arbitrary Talking Style for Realistic Audio-Driven Talking Face Synthesis

【MM 2021】基于单张图像的多风格说话人合成，Imitating Arbitrary Talking Style for Realistic Audio-Driven Talking Face Synthesis

专知会员服务

6+阅读 · 2022年3月22日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

基于神经网络的跨语言实体链指研究

国家自然科学基金

4+阅读 · 2015年12月31日

拓扑绝缘体与超导体耦合体系中交叉Andreev反射研究

国家自然科学基金

1+阅读 · 2014年12月31日

非均质量子器件Schr？dinger-Poisson系统多尺度分析与算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

MeV离子辐照下钙钛矿型钛酸盐晶体结构稳定性的研究

国家自然科学基金

0+阅读 · 2013年12月31日

拟南芥AMOS1基因介导的铵胁迫信号传导途径研究

国家自然科学基金

0+阅读 · 2012年12月31日

放射状胶质细胞在海马神经再生中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

aFGF改构体调控GSK3β磷酸化及其靶标分子治疗AD的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

量子散射中的异常现象、Levinson 定理及其它

国家自然科学基金

0+阅读 · 2011年12月31日

胶质细胞αnAChR激活在削弱Aβ25233;制神经细胞生成中的作用与调控机制

国家自然科学基金

0+阅读 · 2009年12月31日

Zero-shot Image-to-Image Translation

Arxiv

0+阅读 · 2023年2月6日

Structure and Content-Guided Video Synthesis with Diffusion Models

Arxiv

1+阅读 · 2023年2月6日

High Resolution Face Editing with Masked GAN Latent Code Optimization

Arxiv

0+阅读 · 2023年2月6日

Diffusion Model for Generative Image Denoising

Arxiv

0+阅读 · 2023年2月5日

Eliminating Prior Bias for Semantic Image Editing via Dual-Cycle Diffusion

Arxiv

0+阅读 · 2023年2月5日

Failure-informed adaptive sampling for PINNs, Part II: combining with re-sampling and subset simulation

Arxiv

0+阅读 · 2023年2月3日

A Survey on Generative Diffusion Model

Arxiv

46+阅读 · 2022年9月6日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

f-VAEGAN-D2: A Feature Generating Framework for Any-Shot Learning

Arxiv

11+阅读 · 2019年3月25日

Improved Image Segmentation via Cost Minimization of Multiple Hypotheses

Arxiv

14+阅读 · 2018年1月31日

VIP会员

文章信息

相关主题

相关VIP内容

【ACM UMAP 2022 】可复现推荐系统的语义感知内容表示，148页ppt

【ACM UMAP 2022 】可复现推荐系统的语义感知内容表示，148页ppt

专知会员服务

17+阅读 · 2022年7月6日

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【MM 2021】基于单张图像的多风格说话人合成，Imitating Arbitrary Talking Style for Realistic Audio-Driven Talking Face Synthesis

【MM 2021】基于单张图像的多风格说话人合成，Imitating Arbitrary Talking Style for Realistic Audio-Driven Talking Face Synthesis

专知会员服务

6+阅读 · 2022年3月22日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Zero-shot Image-to-Image Translation

Arxiv

0+阅读 · 2023年2月6日

Structure and Content-Guided Video Synthesis with Diffusion Models

Arxiv

1+阅读 · 2023年2月6日

High Resolution Face Editing with Masked GAN Latent Code Optimization

Arxiv

0+阅读 · 2023年2月6日

Diffusion Model for Generative Image Denoising

Arxiv

0+阅读 · 2023年2月5日

Eliminating Prior Bias for Semantic Image Editing via Dual-Cycle Diffusion

Arxiv

0+阅读 · 2023年2月5日

Failure-informed adaptive sampling for PINNs, Part II: combining with re-sampling and subset simulation

Arxiv

0+阅读 · 2023年2月3日

A Survey on Generative Diffusion Model

Arxiv

46+阅读 · 2022年9月6日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

f-VAEGAN-D2: A Feature Generating Framework for Any-Shot Learning

Arxiv

11+阅读 · 2019年3月25日

Improved Image Segmentation via Cost Minimization of Multiple Hypotheses

Arxiv

14+阅读 · 2018年1月31日

相关基金

基于神经网络的跨语言实体链指研究

国家自然科学基金

4+阅读 · 2015年12月31日

拓扑绝缘体与超导体耦合体系中交叉Andreev反射研究

国家自然科学基金

1+阅读 · 2014年12月31日

非均质量子器件Schr？dinger-Poisson系统多尺度分析与算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

MeV离子辐照下钙钛矿型钛酸盐晶体结构稳定性的研究

国家自然科学基金

0+阅读 · 2013年12月31日

拟南芥AMOS1基因介导的铵胁迫信号传导途径研究

国家自然科学基金

0+阅读 · 2012年12月31日

放射状胶质细胞在海马神经再生中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

aFGF改构体调控GSK3β磷酸化及其靶标分子治疗AD的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

量子散射中的异常现象、Levinson 定理及其它

国家自然科学基金

0+阅读 · 2011年12月31日

胶质细胞αnAChR激活在削弱Aβ25233;制神经细胞生成中的作用与调控机制

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员