多种传播:为受控图像生成提供扩散路径 (MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation) - 专知论文

会员服务 ·

0

控制器 · Processing（编程语言） · MoDELS · 讲稿 · HTTPS ·

2023 年 2 月 16 日

MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation

翻译：多种传播:为受控图像生成提供扩散路径

Omer Bar-Tal,Lior Yariv,Yaron Lipman,Tali Dekel

Recent advances in text-to-image generation with diffusion models present transformative capabilities in image quality. However, user controllability of the generated image, and fast adaptation to new tasks still remains an open challenge, currently mostly addressed by costly and long re-training and fine-tuning or ad-hoc adaptations to specific image generation tasks. In this work, we present MultiDiffusion, a unified framework that enables versatile and controllable image generation, using a pre-trained text-to-image diffusion model, without any further training or finetuning. At the center of our approach is a new generation process, based on an optimization task that binds together multiple diffusion generation processes with a shared set of parameters or constraints. We show that MultiDiffusion can be readily applied to generate high quality and diverse images that adhere to user-provided controls, such as desired aspect ratio (e.g., panorama), and spatial guiding signals, ranging from tight segmentation masks to bounding boxes. Project webpage: https://multidiffusion.github.io

翻译：以传播模型制作文本到图像方面的最新进展体现了图像质量的变革能力。然而,生成图像的用户可控制性以及快速适应新任务仍然是一项公开的挑战,目前,主要通过对特定图像生成任务进行昂贵和长期的再培训和微调或特别调整来解决。在这项工作中,我们介绍了多发化,这是一个统一框架,能够使用预先培训的文本到图像扩散模型,进行多功能和可控的图像生成,无需任何进一步的培训或微调。我们方法的核心是一个新一代过程,其基础是优化工作,将多个扩散生成进程与一套共同参数或制约因素结合在一起。我们表明,多发化可以很容易地用于生成高质量和多样的图像,以遵守用户提供的控制,例如理想的侧比(例如,全景),以及空间指导信号,从紧紧的分解面罩到捆绑盒。项目网页:https://multidifulation.ghub.github.io。

0

相关内容

控制器

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

专知会员服务

28+阅读 · 2020年2月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

肺炎支原体外排泵ABC Transporter在大环内酯类耐药中的作用机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

靶向抑制 MNK-eIF4E 轴增效TRAIL治疗鼻咽癌的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

聚磷酸盐激酶1在奇异变形杆菌致尿路感染中的作用及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

miRNA在补体介导甲型H1N1流感肺部炎症损伤中的作用及调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

流行性乙型脑炎病毒感染相关宿主miRNA的鉴定与功能分析

国家自然科学基金

0+阅读 · 2012年12月31日

LIMK1：罗格列酮抑制人胃癌细胞增殖、迁移及侵袭的作用靶点

国家自然科学基金

0+阅读 · 2012年12月31日

TIP30核内化的分子机制及其与EGFR信号通路的相关性研究

国家自然科学基金

0+阅读 · 2012年12月31日

化疗药物诱导大肠癌上皮间质转化过程中PrPc-STAT3通路的作用及机制

国家自然科学基金

0+阅读 · 2011年12月31日

新疆维吾尔族肾虚血瘀型耳聋与线粒体基因多态性的相关性研究

国家自然科学基金

0+阅读 · 2008年12月31日

Ambiguous Medical Image Segmentation using Diffusion Models

Arxiv

2+阅读 · 2023年4月10日

BerDiff: Conditional Bernoulli Diffusion Model for Medical Image Segmentation

Arxiv

1+阅读 · 2023年4月10日

Towards Real-time Text-driven Image Manipulation with Unconditional Diffusion Models

Arxiv

0+阅读 · 2023年4月10日

Split, Merge, and Refine: Fitting Tight Bounding Boxes via Learned Over-Segmentation and Iterative Search

Arxiv

0+阅读 · 2023年4月10日

ShapeCrafter: A Recursive Text-Conditioned 3D Shape Generation Model

Arxiv

0+阅读 · 2023年4月8日

InstantBooth: Personalized Text-to-Image Generation without Test-Time Finetuning

Arxiv

0+阅读 · 2023年4月6日

InterFormer: Real-time Interactive Image Segmentation

Arxiv

0+阅读 · 2023年4月6日

Taming Encoder for Zero Fine-tuning Image Customization with Text-to-Image Diffusion Models

Arxiv

0+阅读 · 2023年4月5日

Highly Personalized Text Embedding for Image Manipulation by Stable Diffusion

Arxiv

0+阅读 · 2023年4月5日

Cross-modulated Few-shot Image Generation for Colorectal Tissue Classification

Arxiv

0+阅读 · 2023年4月4日

VIP会员

文章信息

相关主题

Processing（编程语言）

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

专知会员服务

28+阅读 · 2020年2月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】低维与高维空间中潜在表征的分析、建模与变换

《生态建模密码破译：建模与编程实践》美陆军最新报告

大模型解决方案白皮书：社交陪伴场景全流程落地指南

面向具身操作的视觉-语言-动作模型综述

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

相关论文

Ambiguous Medical Image Segmentation using Diffusion Models

Arxiv

2+阅读 · 2023年4月10日

BerDiff: Conditional Bernoulli Diffusion Model for Medical Image Segmentation

Arxiv

1+阅读 · 2023年4月10日

Towards Real-time Text-driven Image Manipulation with Unconditional Diffusion Models

Arxiv

0+阅读 · 2023年4月10日

Split, Merge, and Refine: Fitting Tight Bounding Boxes via Learned Over-Segmentation and Iterative Search

Arxiv

0+阅读 · 2023年4月10日

ShapeCrafter: A Recursive Text-Conditioned 3D Shape Generation Model

Arxiv

0+阅读 · 2023年4月8日

InstantBooth: Personalized Text-to-Image Generation without Test-Time Finetuning

Arxiv

0+阅读 · 2023年4月6日

InterFormer: Real-time Interactive Image Segmentation

Arxiv

0+阅读 · 2023年4月6日

Taming Encoder for Zero Fine-tuning Image Customization with Text-to-Image Diffusion Models

Arxiv

0+阅读 · 2023年4月5日

Highly Personalized Text Embedding for Image Manipulation by Stable Diffusion

Arxiv

0+阅读 · 2023年4月5日

Cross-modulated Few-shot Image Generation for Colorectal Tissue Classification

Arxiv

0+阅读 · 2023年4月4日

相关基金

肺炎支原体外排泵ABC Transporter在大环内酯类耐药中的作用机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

靶向抑制 MNK-eIF4E 轴增效TRAIL治疗鼻咽癌的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

聚磷酸盐激酶1在奇异变形杆菌致尿路感染中的作用及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

miRNA在补体介导甲型H1N1流感肺部炎症损伤中的作用及调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

流行性乙型脑炎病毒感染相关宿主miRNA的鉴定与功能分析

国家自然科学基金

0+阅读 · 2012年12月31日

LIMK1：罗格列酮抑制人胃癌细胞增殖、迁移及侵袭的作用靶点

国家自然科学基金

0+阅读 · 2012年12月31日

TIP30核内化的分子机制及其与EGFR信号通路的相关性研究

国家自然科学基金

0+阅读 · 2012年12月31日

化疗药物诱导大肠癌上皮间质转化过程中PrPc-STAT3通路的作用及机制

国家自然科学基金

0+阅读 · 2011年12月31日

新疆维吾尔族肾虚血瘀型耳聋与线粒体基因多态性的相关性研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员