StyleCLIP: StyleGAN 图像的文本驱动操作 (StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery) - 专知论文

会员服务 ·

0

StyleGAN · Prompt · Extensibility · 潜在 · INTERACT ·

2021 年 3 月 31 日

StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery

翻译：StyleCLIP: StyleGAN 图像的文本驱动操作

Or Patashnik,Zongze Wu,Eli Shechtman,Daniel Cohen-Or,Dani Lischinski

from arxiv, 18 pages, 24 figures, code and video may be found here: https://github.com/orpatashnik/StyleCLIP

Inspired by the ability of StyleGAN to generate highly realistic images in a variety of domains, much recent work has focused on understanding how to use the latent spaces of StyleGAN to manipulate generated and real images. However, discovering semantically meaningful latent manipulations typically involves painstaking human examination of the many degrees of freedom, or an annotated collection of images for each desired manipulation. In this work, we explore leveraging the power of recently introduced Contrastive Language-Image Pre-training (CLIP) models in order to develop a text-based interface for StyleGAN image manipulation that does not require such manual effort. We first introduce an optimization scheme that utilizes a CLIP-based loss to modify an input latent vector in response to a user-provided text prompt. Next, we describe a latent mapper that infers a text-guided latent manipulation step for a given input image, allowing faster and more stable text-based manipulation. Finally, we present a method for mapping a text prompts to input-agnostic directions in StyleGAN's style space, enabling interactive text-driven image manipulation. Extensive results and comparisons demonstrate the effectiveness of our approaches.

翻译：受StyleGAN在不同领域生成高度现实图像的能力的启发,最近许多工作侧重于了解如何利用StyleGAN的潜在空间来操纵生成的图像和实际图像。然而,发现具有内在意义的潜在操纵通常需要人类对自由的多种程度进行艰苦检查,或为每个想要的操作收集附加注释的图像。在这项工作中,我们探索如何利用最近引入的对抗性语言图像培训前模型的力量,以便为StyleGAN图像操作开发一个基于文本的界面,而不需要这种手工操作。我们首先引入一个优化方案,利用基于 CLIP 的损失来修改输入的潜在矢量,以响应用户提供的文本提示。接下来,我们描述一个潜在的映像器,为某种输入图像提供文本指导的潜在操纵步骤,允许更快和更稳定的文本操纵。最后,我们提出了一个方法,用于绘制文本提示到StyleGAN风格空间的输入-不可知性方向,从而能够进行交互式文本驱动图像操纵。我们的广泛结果和比较展示了我们方法的有效性。

0

相关内容

StyleGAN

基于深度学习的图像实例分割技术研究进展

专知会员服务

63+阅读 · 2021年3月6日

生成对抗网络GAN在各领域应用研究进展(中文版)，37页pdf

生成对抗网络GAN在各领域应用研究进展(中文版)，37页pdf

专知会员服务

151+阅读 · 2020年12月30日

自然语言处理中的注意力机制，Attention in Natural Language Processing

自然语言处理中的注意力机制，Attention in Natural Language Processing

专知会员服务

136+阅读 · 2020年5月30日

深度学习图像分割综述论文最新版，Image Segmentation Using Deep Learning: A Survey

深度学习图像分割综述论文最新版，Image Segmentation Using Deep Learning: A Survey

专知会员服务

93+阅读 · 2020年4月11日

【论文推荐】不同图像域弱监督语义分割的综合分析，A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains

【论文推荐】不同图像域弱监督语义分割的综合分析，A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains

专知会员服务

28+阅读 · 2019年12月27日

【医学图像分割| 2019新综述】生物医学图像分割的机器学习技术：技术方面综述和最新应用介绍（Machine Learning Techniques for Biomedical Image Segmentation: An Overview of Technical Aspects and Introduction to State-of-Art Applications），附35页PDF

【医学图像分割| 2019新综述】生物医学图像分割的机器学习技术：技术方面综述和最新应用介绍（Machine Learning Techniques for Biomedical Image Segmentation: An Overview of Technical Aspects and Introduction to State-of-Art Applications），附35页PDF

专知会员服务

57+阅读 · 2019年11月23日

【图像分割| 2019最新综述】自然图像和医学图像的深层语义分割，附21页PDF（Deep Semantic Segmentation of Natural and Medical Images: A Review）

【图像分割| 2019最新综述】自然图像和医学图像的深层语义分割，附21页PDF（Deep Semantic Segmentation of Natural and Medical Images: A Review）

专知会员服务

54+阅读 · 2019年11月16日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

人脸合成效果媲美StyleGAN，而它是个自编码器

人脸合成效果媲美StyleGAN，而它是个自编码器

机器之心

3+阅读 · 2020年4月26日

来，用StyleGAN给自己捏一个对象

来，用StyleGAN给自己捏一个对象

QCon

3+阅读 · 2019年2月21日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

CycleGAN：图片风格，想换就换 | ICCV 2017论文解读

CycleGAN：图片风格，想换就换 | ICCV 2017论文解读

PaperWeekly

12+阅读 · 2018年3月14日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

ICCV17 :12为顶级大牛教你学生成对抗网络（GAN)！

ICCV17 :12为顶级大牛教你学生成对抗网络（GAN)！

全球人工智能

8+阅读 · 2017年11月26日

gan生成图像at 1024² 的代码论文

gan生成图像at 1024² 的代码论文

CreateAMind

4+阅读 · 2017年10月31日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Exploiting Diverse Characteristics and Adversarial Ambivalence for Domain Adaptive Segmentation

Exploiting Diverse Characteristics and Adversarial Ambivalence for Domain Adaptive Segmentation

Arxiv

9+阅读 · 2020年12月10日

In-Domain GAN Inversion for Real Image Editing

Arxiv

3+阅读 · 2020年7月16日

Data augmentation using learned transforms for one-shot medical image segmentation

Arxiv

5+阅读 · 2019年2月25日

Red blood cell image generation for data augmentation using Conditional Generative Adversarial Networks

Red blood cell image generation for data augmentation using Conditional Generative Adversarial Networks

Arxiv

4+阅读 · 2019年1月18日

High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs

High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs

Arxiv

3+阅读 · 2018年8月20日

Convexity Shape Prior for Level Set based Image Segmentation Method

Arxiv

4+阅读 · 2018年5月22日

Mask-aware Photorealistic Face Attribute Manipulation

Arxiv

5+阅读 · 2018年4月24日

Image Segmentation Using Subspace Representation and Sparse Decomposition

Arxiv

6+阅读 · 2018年4月6日

Adaptive strategy for superpixel-based region-growing image segmentation

Arxiv

4+阅读 · 2018年3月17日

Adversarial Attribute-Image Person Re-identification

Arxiv

7+阅读 · 2018年2月6日

VIP会员

文章信息

相关主题

相关VIP内容

基于深度学习的图像实例分割技术研究进展

专知会员服务

63+阅读 · 2021年3月6日

生成对抗网络GAN在各领域应用研究进展(中文版)，37页pdf

生成对抗网络GAN在各领域应用研究进展(中文版)，37页pdf

专知会员服务

151+阅读 · 2020年12月30日

自然语言处理中的注意力机制，Attention in Natural Language Processing

自然语言处理中的注意力机制，Attention in Natural Language Processing

专知会员服务

136+阅读 · 2020年5月30日

深度学习图像分割综述论文最新版，Image Segmentation Using Deep Learning: A Survey

深度学习图像分割综述论文最新版，Image Segmentation Using Deep Learning: A Survey

专知会员服务

93+阅读 · 2020年4月11日

【论文推荐】不同图像域弱监督语义分割的综合分析，A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains

【论文推荐】不同图像域弱监督语义分割的综合分析，A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains

专知会员服务

28+阅读 · 2019年12月27日

【医学图像分割| 2019新综述】生物医学图像分割的机器学习技术：技术方面综述和最新应用介绍（Machine Learning Techniques for Biomedical Image Segmentation: An Overview of Technical Aspects and Introduction to State-of-Art Applications），附35页PDF

【医学图像分割| 2019新综述】生物医学图像分割的机器学习技术：技术方面综述和最新应用介绍（Machine Learning Techniques for Biomedical Image Segmentation: An Overview of Technical Aspects and Introduction to State-of-Art Applications），附35页PDF

专知会员服务

57+阅读 · 2019年11月23日

【图像分割| 2019最新综述】自然图像和医学图像的深层语义分割，附21页PDF（Deep Semantic Segmentation of Natural and Medical Images: A Review）

【图像分割| 2019最新综述】自然图像和医学图像的深层语义分割，附21页PDF（Deep Semantic Segmentation of Natural and Medical Images: A Review）

专知会员服务

54+阅读 · 2019年11月16日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

NeurIPS 2025 | 自动化所新作速览（一）

大型语言模型（LLM）赋能的知识图谱构建：综述

NeurIPS 2025 | 自动化所新作速览（二）

领域特定文本分类中的预训练语言模型新进展：系统综述

相关资讯

人脸合成效果媲美StyleGAN，而它是个自编码器

人脸合成效果媲美StyleGAN，而它是个自编码器

机器之心

3+阅读 · 2020年4月26日

来，用StyleGAN给自己捏一个对象

来，用StyleGAN给自己捏一个对象

QCon

3+阅读 · 2019年2月21日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

CycleGAN：图片风格，想换就换 | ICCV 2017论文解读

CycleGAN：图片风格，想换就换 | ICCV 2017论文解读

PaperWeekly

12+阅读 · 2018年3月14日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

ICCV17 :12为顶级大牛教你学生成对抗网络（GAN)！

ICCV17 :12为顶级大牛教你学生成对抗网络（GAN)！

全球人工智能

8+阅读 · 2017年11月26日

gan生成图像at 1024² 的代码论文

gan生成图像at 1024² 的代码论文

CreateAMind

4+阅读 · 2017年10月31日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Exploiting Diverse Characteristics and Adversarial Ambivalence for Domain Adaptive Segmentation

Exploiting Diverse Characteristics and Adversarial Ambivalence for Domain Adaptive Segmentation

Arxiv

9+阅读 · 2020年12月10日

In-Domain GAN Inversion for Real Image Editing

Arxiv

3+阅读 · 2020年7月16日

Data augmentation using learned transforms for one-shot medical image segmentation

Arxiv

5+阅读 · 2019年2月25日

Red blood cell image generation for data augmentation using Conditional Generative Adversarial Networks

Red blood cell image generation for data augmentation using Conditional Generative Adversarial Networks

Arxiv

4+阅读 · 2019年1月18日

High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs

High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs

Arxiv

3+阅读 · 2018年8月20日

Convexity Shape Prior for Level Set based Image Segmentation Method

Arxiv

4+阅读 · 2018年5月22日

Mask-aware Photorealistic Face Attribute Manipulation

Arxiv

5+阅读 · 2018年4月24日

Image Segmentation Using Subspace Representation and Sparse Decomposition

Arxiv

6+阅读 · 2018年4月6日

Adaptive strategy for superpixel-based region-growing image segmentation

Arxiv

4+阅读 · 2018年3月17日

Adversarial Attribute-Image Person Re-identification

Arxiv

7+阅读 · 2018年2月6日

微信扫码咨询专知VIP会员