ChatFace: Chat-Guided Real Face Editing via Diffusion Latent Space Manipulation - 专知论文

会员服务 ·

0

Performer · 潜在 · INTERACT · 查准率/准确率 · MoDELS ·

2023 年 6 月 5 日

ChatFace: Chat-Guided Real Face Editing via Diffusion Latent Space Manipulation

翻译：暂无翻译

Dongxu Yue,Qin Guo,Munan Ning,Jiaxi Cui,Yuesheng Zhu,Li Yuan

Editing real facial images is a crucial task in computer vision with significant demand in various real-world applications. While GAN-based methods have showed potential in manipulating images especially when combined with CLIP, these methods are limited in their ability to reconstruct real images due to challenging GAN inversion capability. Despite the successful image reconstruction achieved by diffusion-based methods, there are still challenges in effectively manipulating fine-gained facial attributes with textual instructions.To address these issues and facilitate convenient manipulation of real facial images, we propose a novel approach that conduct text-driven image editing in the semantic latent space of diffusion model. By aligning the temporal feature of the diffusion model with the semantic condition at generative process, we introduce a stable manipulation strategy, which perform precise zero-shot manipulation effectively. Furthermore, we develop an interactive system named ChatFace, which combines the zero-shot reasoning ability of large language models to perform efficient manipulations in diffusion semantic latent space. This system enables users to perform complex multi-attribute manipulations through dialogue, opening up new possibilities for interactive image editing. Extensive experiments confirmed that our approach outperforms previous methods and enables precise editing of real facial images, making it a promising candidate for real-world applications. Project page: https://dongxuyue.github.io/chatface/

翻译：暂无翻译

0

相关内容

Performer

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

大神一年100篇论文

大神一年100篇论文

CreateAMind

15+阅读 · 2018年12月31日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

线粒体自噬-Warburg效应介导apelin促血管平滑肌细胞增殖

国家自然科学基金

0+阅读 · 2014年12月31日

主动脉瓣反流瓣膜置换术后左室收缩功能转归: 三维斑点追踪超声评价与miRNA检测的同步研究

国家自然科学基金

0+阅读 · 2013年12月31日

表面等离激元增强的核壳型微纳米线阵列LED

国家自然科学基金

0+阅读 · 2013年12月31日

量子点和稀土离子共敏化二氧化钛纳米管阵列太阳能电池的研究

国家自然科学基金

0+阅读 · 2012年12月31日

超声辅助Al/Mg半固态搅拌摩擦复合机理及接头微观组织演变研究

国家自然科学基金

0+阅读 · 2012年12月31日

低维冷原子系统中的Wilson系数和量子临界性

国家自然科学基金

0+阅读 · 2012年12月31日

新型金属杯芳烃的设计、合成及分子识别性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

IV-VI族二维纳米结构的可控合成及光电性能

国家自然科学基金

0+阅读 · 2011年12月31日

大面积纳米晶太阳能电池的柔性化及敏化染料压印植入制造工艺

国家自然科学基金

0+阅读 · 2009年12月31日

人可溶型IL-13受体α#23545;成纤维细胞胶原生成作用的研究

国家自然科学基金

0+阅读 · 2008年12月31日

Diverse Inpainting and Editing with GAN Inversion

Arxiv

0+阅读 · 2023年7月27日

Gloss-free Sign Language Translation: Improving from Visual-Language Pretraining

Arxiv

0+阅读 · 2023年7月27日

Is ChatGPT a Good Personality Recognizer? A Preliminary Study

Arxiv

0+阅读 · 2023年7月26日

Visual Instruction Inversion: Image Editing via Visual Prompting

Arxiv

0+阅读 · 2023年7月26日

Waypoint-Based Imitation Learning for Robotic Manipulation

Arxiv

0+阅读 · 2023年7月26日

Artifact Restoration in Histology Images with Diffusion Probabilistic Models

Arxiv

0+阅读 · 2023年7月26日

Controlling the Latent Space of GANs through Reinforcement Learning: A Case Study on Task-based Image-to-Image Translation

Arxiv

0+阅读 · 2023年7月26日

K-VIL: Keypoints-based Visual Imitation Learning

Arxiv

0+阅读 · 2023年7月25日

StyleHumanCLIP: Text-guided Garment Manipulation for StyleGAN-Human

Arxiv

0+阅读 · 2023年7月25日

Semi-supervised Medical Image Segmentation through Dual-task Consistency

Arxiv

14+阅读 · 2020年9月9日

VIP会员

文章信息

相关主题

查准率/准确率

相关VIP内容

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

Deep Research（深度研究）：系统性综述

《革新战术战场空间能力：反无人机系统》报告

【普林斯顿博士论文】用于语音的生成式通用模型

螺旋式开发作为战略资产：美军启示

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

大神一年100篇论文

大神一年100篇论文

CreateAMind

15+阅读 · 2018年12月31日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

相关论文

Diverse Inpainting and Editing with GAN Inversion

Arxiv

0+阅读 · 2023年7月27日

Gloss-free Sign Language Translation: Improving from Visual-Language Pretraining

Arxiv

0+阅读 · 2023年7月27日

Is ChatGPT a Good Personality Recognizer? A Preliminary Study

Arxiv

0+阅读 · 2023年7月26日

Visual Instruction Inversion: Image Editing via Visual Prompting

Arxiv

0+阅读 · 2023年7月26日

Waypoint-Based Imitation Learning for Robotic Manipulation

Arxiv

0+阅读 · 2023年7月26日

Artifact Restoration in Histology Images with Diffusion Probabilistic Models

Arxiv

0+阅读 · 2023年7月26日

Controlling the Latent Space of GANs through Reinforcement Learning: A Case Study on Task-based Image-to-Image Translation

Arxiv

0+阅读 · 2023年7月26日

K-VIL: Keypoints-based Visual Imitation Learning

Arxiv

0+阅读 · 2023年7月25日

StyleHumanCLIP: Text-guided Garment Manipulation for StyleGAN-Human

Arxiv

0+阅读 · 2023年7月25日

Semi-supervised Medical Image Segmentation through Dual-task Consistency

Arxiv

14+阅读 · 2020年9月9日

相关基金

线粒体自噬-Warburg效应介导apelin促血管平滑肌细胞增殖

国家自然科学基金

0+阅读 · 2014年12月31日

主动脉瓣反流瓣膜置换术后左室收缩功能转归: 三维斑点追踪超声评价与miRNA检测的同步研究

国家自然科学基金

0+阅读 · 2013年12月31日

表面等离激元增强的核壳型微纳米线阵列LED

国家自然科学基金

0+阅读 · 2013年12月31日

量子点和稀土离子共敏化二氧化钛纳米管阵列太阳能电池的研究

国家自然科学基金

0+阅读 · 2012年12月31日

超声辅助Al/Mg半固态搅拌摩擦复合机理及接头微观组织演变研究

国家自然科学基金

0+阅读 · 2012年12月31日

低维冷原子系统中的Wilson系数和量子临界性

国家自然科学基金

0+阅读 · 2012年12月31日

新型金属杯芳烃的设计、合成及分子识别性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

IV-VI族二维纳米结构的可控合成及光电性能

国家自然科学基金

0+阅读 · 2011年12月31日

大面积纳米晶太阳能电池的柔性化及敏化染料压印植入制造工艺

国家自然科学基金

0+阅读 · 2009年12月31日

人可溶型IL-13受体α#23545;成纤维细胞胶原生成作用的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员