Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models - 专知论文

会员服务 ·

0

控制器 · MoDELS · 可约的 · HTTPS · 可理解性 ·

2023 年 5 月 25 日

Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models

翻译：暂无翻译

Shihao Zhao,Dongdong Chen,Yen-Chun Chen,Jianmin Bao,Shaozhe Hao,Lu Yuan,Kwan-Yee K. Wong

from arxiv, Code is available at https://github.com/ShihaoZhaoZSH/Uni-ControlNet

Text-to-Image diffusion models have made tremendous progress over the past two years, enabling the generation of highly realistic images based on open-domain text descriptions. However, despite their success, text descriptions often struggle to adequately convey detailed controls, even when composed of long and complex texts. Moreover, recent studies have also shown that these models face challenges in understanding such complex texts and generating the corresponding images. Therefore, there is a growing need to enable more control modes beyond text description. In this paper, we introduce Uni-ControlNet, a novel approach that allows for the simultaneous utilization of different local controls (e.g., edge maps, depth map, segmentation masks) and global controls (e.g., CLIP image embeddings) in a flexible and composable manner within one model. Unlike existing methods, Uni-ControlNet only requires the fine-tuning of two additional adapters upon frozen pre-trained text-to-image diffusion models, eliminating the huge cost of training from scratch. Moreover, thanks to some dedicated adapter designs, Uni-ControlNet only necessitates a constant number (i.e., 2) of adapters, regardless of the number of local or global controls used. This not only reduces the fine-tuning costs and model size, making it more suitable for real-world deployment, but also facilitate composability of different conditions. Through both quantitative and qualitative comparisons, Uni-ControlNet demonstrates its superiority over existing methods in terms of controllability, generation quality and composability. Code is available at \url{https://github.com/ShihaoZhaoZSH/Uni-ControlNet}.

翻译：暂无翻译

0

相关内容

控制器

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

NKT细胞激活剂，α-半乳糖苷神经酰胺治疗耐药结核杆菌的研究

国家自然科学基金

0+阅读 · 2014年12月31日

REVOLUTA基因在豆科模式植物蒺藜苜蓿复叶发育中的功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

异位灶微环境调节性T细胞的功能调节及其作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

miR-185抑制前列腺癌细胞中雄激素受体的表达及其介导的信号通路的作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

重调和方程基于Poisson算子的高效有限元方法

国家自然科学基金

0+阅读 · 2011年12月31日

HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models

Arxiv

0+阅读 · 2023年7月13日

Domain-Agnostic Tuning-Encoder for Fast Personalization of Text-To-Image Models

Arxiv

0+阅读 · 2023年7月13日

Typology of Risks of Generative Text-to-Image Models

Arxiv

0+阅读 · 2023年7月8日

A Survey on Generative Diffusion Model

Arxiv

46+阅读 · 2022年9月6日

From Show to Tell: A Survey on Image Captioning

Arxiv

15+阅读 · 2021年7月14日

VIP会员

文章信息

相关主题

相关VIP内容

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《利用人工智能改善军事警察行动：当下现状探索》最新95页报告

《用于适应性、任务就绪型军用仿生机器人的合成数据管道》

面向现代武装力量的高级AI驱动军事模拟与训练软件

《军事应用中的AI：建立信任》最新报告

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

相关论文

HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models

Arxiv

0+阅读 · 2023年7月13日

Domain-Agnostic Tuning-Encoder for Fast Personalization of Text-To-Image Models

Arxiv

0+阅读 · 2023年7月13日

Typology of Risks of Generative Text-to-Image Models

Arxiv

0+阅读 · 2023年7月8日

A Survey on Generative Diffusion Model

Arxiv

46+阅读 · 2022年9月6日

From Show to Tell: A Survey on Image Captioning

Arxiv

15+阅读 · 2021年7月14日

相关基金

NKT细胞激活剂，α-半乳糖苷神经酰胺治疗耐药结核杆菌的研究

国家自然科学基金

0+阅读 · 2014年12月31日

REVOLUTA基因在豆科模式植物蒺藜苜蓿复叶发育中的功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

异位灶微环境调节性T细胞的功能调节及其作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

miR-185抑制前列腺癌细胞中雄激素受体的表达及其介导的信号通路的作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

重调和方程基于Poisson算子的高效有限元方法

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员