A Neural Space-Time Representation for Text-to-Image Personalization - 专知论文

会员服务 ·

0

逼真度 · 输出 · Learning · 表示 · Processing（编程语言） ·

2023 年 5 月 24 日

A Neural Space-Time Representation for Text-to-Image Personalization

翻译：暂无翻译

Yuval Alaluf,Elad Richardson,Gal Metzer,Daniel Cohen-Or

from arxiv, Project page available at https://neuraltextualinversion.github.io/NeTI/

A key aspect of text-to-image personalization methods is the manner in which the target concept is represented within the generative process. This choice greatly affects the visual fidelity, downstream editability, and disk space needed to store the learned concept. In this paper, we explore a new text-conditioning space that is dependent on both the denoising process timestep (time) and the denoising U-Net layers (space) and showcase its compelling properties. A single concept in the space-time representation is composed of hundreds of vectors, one for each combination of time and space, making this space challenging to optimize directly. Instead, we propose to implicitly represent a concept in this space by optimizing a small neural mapper that receives the current time and space parameters and outputs the matching token embedding. In doing so, the entire personalized concept is represented by the parameters of the learned mapper, resulting in a compact, yet expressive, representation. Similarly to other personalization methods, the output of our neural mapper resides in the input space of the text encoder. We observe that one can significantly improve the convergence and visual fidelity of the concept by introducing a textual bypass, where our neural mapper additionally outputs a residual that is added to the output of the text encoder. Finally, we show how one can impose an importance-based ordering over our implicit representation, providing users control over the reconstruction and editability of the learned concept using a single trained model. We demonstrate the effectiveness of our approach over a range of concepts and prompts, showing our method's ability to generate high-quality and controllable compositions without fine-tuning any parameters of the generative model itself.

翻译：暂无翻译

0

相关内容

逼真度

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

PARP1通路抑制分子RNF146调控星形胶质细胞凋亡在AD中的作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

CD74调控乳腺癌细胞迁移的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

纳米抗体靶向型载体介导miR122诱导非小细胞肺癌凋亡的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

高电导率、高倍率性能LiVPO4F/C复合正极材料的结构调控及其电化学性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于Diels-Alder反应的可逆交联芳香族聚酰胺及其碳纳米复合材料的制备、表征及性质研究

国家自然科学基金

0+阅读 · 2014年12月31日

CD100阳性NK细胞上调Plexin-B1影响胶质瘤细胞凋亡和侵袭中的分子信号机制

国家自然科学基金

0+阅读 · 2012年12月31日

长非编码RNA BC032469调控胃癌细胞hTERT表达的分子机制研究

国家自然科学基金

1+阅读 · 2012年12月31日

行为驱动、学习效应嵌入的TFT-LCD调度建模及智能优化研究-基于有限理性及中国情景的视角

国家自然科学基金

0+阅读 · 2012年12月31日

胶质瘤中多途径介导的miR-128调控HIF-1信号通路的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

Period2基因调控人胶质瘤细胞凋亡的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

Joint Hierarchical Priors and Adaptive Spatial Resolution for Efficient Neural Image Compression

Arxiv

0+阅读 · 2023年7月12日

Towards Safe Self-Distillation of Internet-Scale Text-to-Image Diffusion Models

Arxiv

0+阅读 · 2023年7月12日

Sem-CS: Semantic CLIPStyler for Text-Based Image Style Transfer

Arxiv

0+阅读 · 2023年7月12日

RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths

Arxiv

0+阅读 · 2023年7月11日

SITTA: A Semantic Image-Text Alignment for Image Captioning

Arxiv

1+阅读 · 2023年7月10日

AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

Arxiv

0+阅读 · 2023年7月10日

NeuSE: Neural SE(3)-Equivariant Embedding for Consistent Spatial Understanding with Objects

Arxiv

0+阅读 · 2023年7月10日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

Z-GCNETs: Time Zigzags at Graph Convolutional Networks for Time Series Forecasting

Arxiv

10+阅读 · 2021年5月10日

Financial Time Series Representation Learning

Financial Time Series Representation Learning

Arxiv

10+阅读 · 2020年3月27日

VIP会员

文章信息

相关主题

Processing（编程语言）

相关VIP内容

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】多目标奖励与偏好优化：理论与算法

《无形的防御者？将定向能武器集成到反无人机框架的机遇与挑战》报告

自主化海军：海上无人系统与未来海战

迈向智能体系统规模化的科学

相关资讯

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

相关论文

Joint Hierarchical Priors and Adaptive Spatial Resolution for Efficient Neural Image Compression

Arxiv

0+阅读 · 2023年7月12日

Towards Safe Self-Distillation of Internet-Scale Text-to-Image Diffusion Models

Arxiv

0+阅读 · 2023年7月12日

Sem-CS: Semantic CLIPStyler for Text-Based Image Style Transfer

Arxiv

0+阅读 · 2023年7月12日

RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths

Arxiv

0+阅读 · 2023年7月11日

SITTA: A Semantic Image-Text Alignment for Image Captioning

Arxiv

1+阅读 · 2023年7月10日

AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

Arxiv

0+阅读 · 2023年7月10日

NeuSE: Neural SE(3)-Equivariant Embedding for Consistent Spatial Understanding with Objects

Arxiv

0+阅读 · 2023年7月10日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

Z-GCNETs: Time Zigzags at Graph Convolutional Networks for Time Series Forecasting

Arxiv

10+阅读 · 2021年5月10日

Financial Time Series Representation Learning

Financial Time Series Representation Learning

Arxiv

10+阅读 · 2020年3月27日

相关基金

PARP1通路抑制分子RNF146调控星形胶质细胞凋亡在AD中的作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

CD74调控乳腺癌细胞迁移的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

纳米抗体靶向型载体介导miR122诱导非小细胞肺癌凋亡的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

高电导率、高倍率性能LiVPO4F/C复合正极材料的结构调控及其电化学性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于Diels-Alder反应的可逆交联芳香族聚酰胺及其碳纳米复合材料的制备、表征及性质研究

国家自然科学基金

0+阅读 · 2014年12月31日

CD100阳性NK细胞上调Plexin-B1影响胶质瘤细胞凋亡和侵袭中的分子信号机制

国家自然科学基金

0+阅读 · 2012年12月31日

长非编码RNA BC032469调控胃癌细胞hTERT表达的分子机制研究

国家自然科学基金

1+阅读 · 2012年12月31日

行为驱动、学习效应嵌入的TFT-LCD调度建模及智能优化研究-基于有限理性及中国情景的视角

国家自然科学基金

0+阅读 · 2012年12月31日

胶质瘤中多途径介导的miR-128调控HIF-1信号通路的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

Period2基因调控人胶质瘤细胞凋亡的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员