语言驱动图像样式传输 (Language-Driven Image Style Transfer) - 专知论文

会员服务 ·

0

contrastive · 判别器 · Performer · 相关系数 · 相似度 ·

2021 年 6 月 1 日

Language-Driven Image Style Transfer

翻译：语言驱动图像样式传输

Tsu-Jui Fu,Xin Eric Wang,William Yang Wang

Despite having promising results, style transfer, which requires preparing style images in advance, may result in lack of creativity and accessibility. Following human instruction, on the other hand, is the most natural way to perform artistic style transfer that can significantly improve controllability for visual effect applications. We introduce a new task -- language-driven image style transfer (\texttt{LDIST}) -- to manipulate the style of a content image, guided by a text. We propose contrastive language visual artist (CLVA) that learns to extract visual semantics from style instructions and accomplish \texttt{LDIST} by the patch-wise style discriminator. The discriminator considers the correlation between language and patches of style images or transferred results to jointly embed style instructions. CLVA further compares contrastive pairs of content image and style instruction to improve the mutual relativeness between transfer results. The transferred results from the same content image can preserve consistent content structures. Besides, they should present analogous style patterns from style instructions that contain similar visual semantics. The experiments show that our CLVA is effective and achieves superb transferred results on \texttt{LDIST}.

翻译：尽管取得了令人充满希望的结果,但风格转换需要事先制作样式图像,这可能导致缺乏创造力和无障碍性。另一方面,在人类教学之后,艺术风格转换是最自然的方法,可以大大提高视觉效果应用程序的可控性。我们引入了一项新的任务 -- -- 语言驱动图像样式转换(\ textt{LDIST}) -- -- 以文本为指导,操控内容图像的样式。我们提出了具有对比性的语言视觉艺术家(CLVA),该视觉艺术家学习从样式指令中提取视觉语义,并通过补丁风格分析师完成\ textt{LDIST}。歧视者考虑了风格图像的语言和补丁或结果传输到联合嵌入样式指令之间的关联性。CLVA进一步比较了内容图像和风格教学的对比性配对,以提高传输结果之间的相对性。同一内容图像的传输结果可以维护一致的内容结构。此外,它们应该从含有类似视觉语义的样式指示中呈现相似的样式模式。实验显示,我们的CLVA是有效的,并且实现了在\texttLDIS}。

0

相关内容

contrastive

最新《图像描述Image Captioning》综述论文，22页pdf220篇文献

专知会员服务

43+阅读 · 2021年7月17日

近期必读的5篇顶会CVPR 2021【图像/视频描述生成】相关论文和代码

专知会员服务

48+阅读 · 2021年4月25日

【CVPR2021】一种基于知识蒸馏的弱监督图像文本匹配模型

专知会员服务

35+阅读 · 2021年4月8日

【CVPR2021】基于相似性分布距离的无监督人脸图像质量评价

专知会员服务

32+阅读 · 2021年3月19日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知会员服务

77+阅读 · 2020年7月23日

【芝加哥大学】可变形的风格转移，Deformable Style Transfer

【芝加哥大学】可变形的风格转移，Deformable Style Transfer

专知会员服务

31+阅读 · 2020年3月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【字节跳动&Adobe】图割多模态风格迁移，Multimodal Style Transfer via Graph Cuts

【字节跳动&Adobe】图割多模态风格迁移，Multimodal Style Transfer via Graph Cuts

专知会员服务

15+阅读 · 2020年1月9日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

【专知荟萃19】图像识别Image Recognition知识资料全集（入门/进阶/论文/综述/视频/专家，附查看）

【专知荟萃19】图像识别Image Recognition知识资料全集（入门/进阶/论文/综述/视频/专家，附查看）

专知

20+阅读 · 2017年11月18日

gan生成图像at 1024² 的代码论文

gan生成图像at 1024² 的代码论文

CreateAMind

4+阅读 · 2017年10月31日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Image to Image Translation Using GAN - Part 2 | 每周话题精选 #06

Image to Image Translation Using GAN - Part 2 | 每周话题精选 #06

PaperWeekly

5+阅读 · 2017年7月19日

Diagonal Attention and Style-based GAN for Content-Style Disentanglement in Image Generation and Translation

Arxiv

0+阅读 · 2021年7月23日

Image-to-Image Translation with Low Resolution Conditioning

Arxiv

0+阅读 · 2021年7月23日

Query2Label: A Simple Transformer Way to Multi-Label Classification

Arxiv

3+阅读 · 2021年7月22日

Image-to-image Translation via Hierarchical Style Disentanglement

Arxiv

8+阅读 · 2021年3月2日

Unsupervised Domain Adaptation for Semantic Segmentation by Content Transfer

Arxiv

4+阅读 · 2020年12月23日

In-Domain GAN Inversion for Real Image Editing

Arxiv

3+阅读 · 2020年7月16日

Two-phase Hair Image Synthesis by Self-Enhancing Generative Model

Two-phase Hair Image Synthesis by Self-Enhancing Generative Model

Arxiv

3+阅读 · 2019年2月28日

Unsupervised Image Captioning

Arxiv

7+阅读 · 2018年11月27日

Building medical image classifiers with very limited data using segmentation networks

Arxiv

4+阅读 · 2018年8月15日

XGAN: Unsupervised Image-to-Image Translation for Many-to-Many Mappings

Arxiv

3+阅读 · 2018年4月25日

VIP会员

文章信息

相关主题

相关VIP内容

最新《图像描述Image Captioning》综述论文，22页pdf220篇文献

专知会员服务

43+阅读 · 2021年7月17日

近期必读的5篇顶会CVPR 2021【图像/视频描述生成】相关论文和代码

专知会员服务

48+阅读 · 2021年4月25日

【CVPR2021】一种基于知识蒸馏的弱监督图像文本匹配模型

专知会员服务

35+阅读 · 2021年4月8日

【CVPR2021】基于相似性分布距离的无监督人脸图像质量评价

专知会员服务

32+阅读 · 2021年3月19日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知会员服务

77+阅读 · 2020年7月23日

【芝加哥大学】可变形的风格转移，Deformable Style Transfer

【芝加哥大学】可变形的风格转移，Deformable Style Transfer

专知会员服务

31+阅读 · 2020年3月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【字节跳动&Adobe】图割多模态风格迁移，Multimodal Style Transfer via Graph Cuts

【字节跳动&Adobe】图割多模态风格迁移，Multimodal Style Transfer via Graph Cuts

专知会员服务

15+阅读 · 2020年1月9日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

【专知荟萃19】图像识别Image Recognition知识资料全集（入门/进阶/论文/综述/视频/专家，附查看）

【专知荟萃19】图像识别Image Recognition知识资料全集（入门/进阶/论文/综述/视频/专家，附查看）

专知

20+阅读 · 2017年11月18日

gan生成图像at 1024² 的代码论文

gan生成图像at 1024² 的代码论文

CreateAMind

4+阅读 · 2017年10月31日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Image to Image Translation Using GAN - Part 2 | 每周话题精选 #06

Image to Image Translation Using GAN - Part 2 | 每周话题精选 #06

PaperWeekly

5+阅读 · 2017年7月19日

相关论文

Diagonal Attention and Style-based GAN for Content-Style Disentanglement in Image Generation and Translation

Arxiv

0+阅读 · 2021年7月23日

Image-to-Image Translation with Low Resolution Conditioning

Arxiv

0+阅读 · 2021年7月23日

Query2Label: A Simple Transformer Way to Multi-Label Classification

Arxiv

3+阅读 · 2021年7月22日

Image-to-image Translation via Hierarchical Style Disentanglement

Arxiv

8+阅读 · 2021年3月2日

Unsupervised Domain Adaptation for Semantic Segmentation by Content Transfer

Arxiv

4+阅读 · 2020年12月23日

In-Domain GAN Inversion for Real Image Editing

Arxiv

3+阅读 · 2020年7月16日

Two-phase Hair Image Synthesis by Self-Enhancing Generative Model

Two-phase Hair Image Synthesis by Self-Enhancing Generative Model

Arxiv

3+阅读 · 2019年2月28日

Unsupervised Image Captioning

Arxiv

7+阅读 · 2018年11月27日

Building medical image classifiers with very limited data using segmentation networks

Arxiv

4+阅读 · 2018年8月15日

XGAN: Unsupervised Image-to-Image Translation for Many-to-Many Mappings

Arxiv

3+阅读 · 2018年4月25日

微信扫码咨询专知VIP会员