KNN- 传播: 通过大型检索生成图像 (KNN-Diffusion: Image Generation via Large-Scale Retrieval) - 专知论文

会员服务 ·

0

KNN · state-of-the-art · MoDELS · Pair · Performer ·

2022 年 10 月 2 日

KNN-Diffusion: Image Generation via Large-Scale Retrieval

翻译：KNN- 传播: 通过大型检索生成图像

Shelly Sheynin,Oron Ashual,Adam Polyak,Uriel Singer,Oran Gafni,Eliya Nachmani,Yaniv Taigman

Recent text-to-image models have achieved impressive results. However, since they require large-scale datasets of text-image pairs, it is impractical to train them on new domains where data is scarce or not labeled. In this work, we propose using large-scale retrieval methods, in particular, efficient k-Nearest-Neighbors (kNN), which offers novel capabilities: (1) training a substantially small and efficient text-to-image diffusion model without any text, (2) generating out-of-distribution images by simply swapping the retrieval database at inference time, and (3) performing text-driven local semantic manipulations while preserving object identity. To demonstrate the robustness of our method, we apply our kNN approach on two state-of-the-art diffusion backbones, and show results on several different datasets. As evaluated by human studies and automatic metrics, our method achieves state-of-the-art results compared to existing approaches that train text-to-image generation models using images only (without paired text data)

翻译：近期的文本到图像模型取得了令人印象深刻的成果。然而,由于它们需要大量文本图像配对的数据集,因此在数据稀缺或没有标签的新领域对它们进行培训是不切实际的。在这项工作中,我们提议使用大规模检索方法,特别是高效的 k-Nearest-Neighbors (kNN),这提供了新的能力:(1) 培训一个没有任何文本的微小而高效的文本到图像传播模型,(2) 简单地在推论时间交换检索数据库,生成发送图像,(3) 进行文本驱动本地语义操作,同时保存对象特性。为了显示我们的方法的稳健性,我们在两个最先进的传播主干线上应用了我们的 kNN 方法,并在几个不同的数据集上展示了结果。根据人类研究和自动指标的评估,我们的方法与仅使用图像(没有配对文本数据)来培训文本到图像生成模型的现有方法相比,取得了最新的结果。

0

相关内容

KNN

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

基于图的半监督学习算法研究

国家自然科学基金

5+阅读 · 2015年12月31日

金属碳化物基低铂介孔催化材料的合成、界面设计与电催化性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

Cf表面修饰强韧化AB3型IMC/Al2O3复合材料的强度稳定性及高温氧化行为

国家自然科学基金

0+阅读 · 2014年12月31日

双尺寸SiCp/AZ91镁基复合材料组织稳定性及高温变形行为研究

国家自然科学基金

0+阅读 · 2012年12月31日

SiCp/Al-Si复合材料界面效应与多尺度第二相协同作用下的组织演变与强化机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

LIMK1：罗格列酮抑制人胃癌细胞增殖、迁移及侵袭的作用靶点

国家自然科学基金

0+阅读 · 2012年12月31日

航空用蠕变时效成形高强铝合金疲劳特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

Cr-Ti-Y-O型纳米团簇氧化物弥散强化CLAM钢的研究

国家自然科学基金

0+阅读 · 2012年12月31日

微合金化Al-Cu-Sc合金的多重强化和多尺度断裂行为研究

国家自然科学基金

0+阅读 · 2011年12月31日

混凝土桥梁构件耐久性数值模拟

国家自然科学基金

0+阅读 · 2008年12月31日

eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers

Arxiv

0+阅读 · 2022年11月8日

Self-conditioned Embedding Diffusion for Text Generation

Arxiv

0+阅读 · 2022年11月8日

Medical Diffusion -- Denoising Diffusion Probabilistic Models for 3D Medical Image Generation

Arxiv

0+阅读 · 2022年11月7日

Few-shot Image Generation with Diffusion Models

Arxiv

0+阅读 · 2022年11月7日

eDiffi: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers

Arxiv

0+阅读 · 2022年11月6日

Progressive Denoising Model for Fine-Grained Text-to-Image Generation

Arxiv

0+阅读 · 2022年11月4日

Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation

Arxiv

11+阅读 · 2021年12月16日

Deep Image Retrieval: A Survey

Arxiv

16+阅读 · 2021年1月27日

A survey on deep hashing for image retrieval

A survey on deep hashing for image retrieval

Arxiv

15+阅读 · 2020年6月10日

DeepSeek: Content Based Image Search & Retrieval

Arxiv

13+阅读 · 2018年1月11日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

发射器定位中的传感器路径规划研究 | 235页

战略无人机 | 2025最新80页

蜂窝通信是否是无人机与无人地面战车主宰战场的关键？

无人机对机动战的影响 | 2025最新文献

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers

Arxiv

0+阅读 · 2022年11月8日

Self-conditioned Embedding Diffusion for Text Generation

Arxiv

0+阅读 · 2022年11月8日

Medical Diffusion -- Denoising Diffusion Probabilistic Models for 3D Medical Image Generation

Arxiv

0+阅读 · 2022年11月7日

Few-shot Image Generation with Diffusion Models

Arxiv

0+阅读 · 2022年11月7日

eDiffi: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers

Arxiv

0+阅读 · 2022年11月6日

Progressive Denoising Model for Fine-Grained Text-to-Image Generation

Arxiv

0+阅读 · 2022年11月4日

Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation

Arxiv

11+阅读 · 2021年12月16日

Deep Image Retrieval: A Survey

Arxiv

16+阅读 · 2021年1月27日

A survey on deep hashing for image retrieval

A survey on deep hashing for image retrieval

Arxiv

15+阅读 · 2020年6月10日

DeepSeek: Content Based Image Search & Retrieval

Arxiv

13+阅读 · 2018年1月11日

相关基金

基于图的半监督学习算法研究

国家自然科学基金

5+阅读 · 2015年12月31日

金属碳化物基低铂介孔催化材料的合成、界面设计与电催化性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

Cf表面修饰强韧化AB3型IMC/Al2O3复合材料的强度稳定性及高温氧化行为

国家自然科学基金

0+阅读 · 2014年12月31日

双尺寸SiCp/AZ91镁基复合材料组织稳定性及高温变形行为研究

国家自然科学基金

0+阅读 · 2012年12月31日

SiCp/Al-Si复合材料界面效应与多尺度第二相协同作用下的组织演变与强化机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

LIMK1：罗格列酮抑制人胃癌细胞增殖、迁移及侵袭的作用靶点

国家自然科学基金

0+阅读 · 2012年12月31日

航空用蠕变时效成形高强铝合金疲劳特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

Cr-Ti-Y-O型纳米团簇氧化物弥散强化CLAM钢的研究

国家自然科学基金

0+阅读 · 2012年12月31日

微合金化Al-Cu-Sc合金的多重强化和多尺度断裂行为研究

国家自然科学基金

0+阅读 · 2011年12月31日

混凝土桥梁构件耐久性数值模拟

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员