由培训前艺术家创作的端到端视觉编辑 (End-to-End Visual Editing with a Generatively Pre-Trained Artist) - 专知论文

会员服务 ·

0

端到端 · Extensibility · 条件概率分布 · 学成 · 目标领域 ·

2022 年 5 月 3 日

End-to-End Visual Editing with a Generatively Pre-Trained Artist

翻译：由培训前艺术家创作的端到端视觉编辑

Andrew Brown,Cheng-Yang Fu,Omkar Parkhi,Tamara L. Berg,Andrea Vedaldi

We consider the targeted image editing problem: blending a region in a source image with a driver image that specifies the desired change. Differently from prior works, we solve this problem by learning a conditional probability distribution of the edits, end-to-end. Training such a model requires addressing a fundamental technical challenge: the lack of example edits for training. To this end, we propose a self-supervised approach that simulates edits by augmenting off-the-shelf images in a target domain. The benefits are remarkable: implemented as a state-of-the-art auto-regressive transformer, our approach is simple, sidesteps difficulties with previous methods based on GAN-like priors, obtains significantly better edits, and is efficient. Furthermore, we show that different blending effects can be learned by an intuitive control of the augmentation process, with no other changes required to the model architecture. We demonstrate the superiority of this approach across several datasets in extensive quantitative and qualitative experiments, including human studies, significantly outperforming prior work.

翻译：我们考虑了有针对性的图像编辑问题: 将一个区域与源图像混在一起, 驱动图像可以指定想要的改变。与先前的工程不同, 我们通过学习编辑、端到端的有条件概率分布来解决这个问题。培训这样的模型需要解决一个根本性的技术挑战: 缺乏用于培训的范例编辑。为此, 我们提议了一种自我监督的方法, 通过在目标域内增加现成图像来模拟编辑。其好处是显著的: 我们的方法很简单, 与以前基于 GAN 类前科的旧方法相冲突, 获得大大改进的编辑, 并且效率很高。此外, 我们显示, 可以通过对增强过程的直观控制来学习不同的混合效应, 而不需要对模型结构作其他修改。我们通过广泛的定量和定性实验, 包括人类研究, 显著超过先前的工作, 展示了这个方法在多个数据集中的优势。

0

相关内容

端到端

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

干扰蛋白酶活化受体2 （PAR2）信号诱导肿瘤干细胞凋亡的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于A-Train卫星观测的沙尘暴数字重构技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于BOTDR技术的岩溶塌陷监测预警试验研究

国家自然科学基金

0+阅读 · 2013年12月31日

ERBB2/ERBB3在PCOS大鼠卵巢胰岛素抵抗发生中的作用及其机制研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于强制振荡下两相流传热的内冷活塞热状态分析与控制

国家自然科学基金

0+阅读 · 2012年12月31日

AlSiC电子封装复合材料磨削用钎焊金刚石微刃砂轮的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于频域的空心砌块通风墙体动态传热模型及实验研究

国家自然科学基金

0+阅读 · 2012年12月31日

hsBAFF上调胞内钙离子激活B淋巴细胞的信号转导网络免疫调控机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

Ter94在Hedgehog信号转导途径中的作用机理

国家自然科学基金

0+阅读 · 2009年12月31日

基于几何约束lifting技术的细分小波变换研究

国家自然科学基金

0+阅读 · 2009年12月31日

Toward Unpaired Multi-modal Medical Image Segmentation via Learning Structured Semantic Consistency

Arxiv

0+阅读 · 2022年6月21日

Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data

Arxiv

0+阅读 · 2022年6月20日

Learning Multiscale Transformer Models for Sequence Generation

Arxiv

0+阅读 · 2022年6月19日

Conditional GANs with Auxiliary Discriminative Classifier

Conditional GANs with Auxiliary Discriminative Classifier

Arxiv

0+阅读 · 2022年6月17日

Self-Supervised Contrastive Pre-Training For Time Series via Time-Frequency Consistency

Arxiv

0+阅读 · 2022年6月17日

Visual Attention Methods in Deep Learning: An In-Depth Survey

Arxiv

44+阅读 · 2022年4月16日

Phase-aware Speech Enhancement with Deep Complex U-Net

Phase-aware Speech Enhancement with Deep Complex U-Net

Arxiv

15+阅读 · 2019年3月7日

Attention U-Net: Learning Where to Look for the Pancreas

Arxiv

17+阅读 · 2018年5月20日

End-to-End Multi-Task Learning with Attention

Arxiv

19+阅读 · 2018年3月28日

Generative Adversarial Networks and Probabilistic Graph Models for Hyperspectral Image Classification

Arxiv

11+阅读 · 2018年2月10日

VIP会员

文章信息

相关主题

条件概率分布

相关VIP内容

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

相关论文

Toward Unpaired Multi-modal Medical Image Segmentation via Learning Structured Semantic Consistency

Arxiv

0+阅读 · 2022年6月21日

Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data

Arxiv

0+阅读 · 2022年6月20日

Learning Multiscale Transformer Models for Sequence Generation

Arxiv

0+阅读 · 2022年6月19日

Conditional GANs with Auxiliary Discriminative Classifier

Conditional GANs with Auxiliary Discriminative Classifier

Arxiv

0+阅读 · 2022年6月17日

Self-Supervised Contrastive Pre-Training For Time Series via Time-Frequency Consistency

Arxiv

0+阅读 · 2022年6月17日

Visual Attention Methods in Deep Learning: An In-Depth Survey

Arxiv

44+阅读 · 2022年4月16日

Phase-aware Speech Enhancement with Deep Complex U-Net

Phase-aware Speech Enhancement with Deep Complex U-Net

Arxiv

15+阅读 · 2019年3月7日

Attention U-Net: Learning Where to Look for the Pancreas

Arxiv

17+阅读 · 2018年5月20日

End-to-End Multi-Task Learning with Attention

Arxiv

19+阅读 · 2018年3月28日

Generative Adversarial Networks and Probabilistic Graph Models for Hyperspectral Image Classification

Arxiv

11+阅读 · 2018年2月10日

相关基金

干扰蛋白酶活化受体2 （PAR2）信号诱导肿瘤干细胞凋亡的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于A-Train卫星观测的沙尘暴数字重构技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于BOTDR技术的岩溶塌陷监测预警试验研究

国家自然科学基金

0+阅读 · 2013年12月31日

ERBB2/ERBB3在PCOS大鼠卵巢胰岛素抵抗发生中的作用及其机制研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于强制振荡下两相流传热的内冷活塞热状态分析与控制

国家自然科学基金

0+阅读 · 2012年12月31日

AlSiC电子封装复合材料磨削用钎焊金刚石微刃砂轮的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于频域的空心砌块通风墙体动态传热模型及实验研究

国家自然科学基金

0+阅读 · 2012年12月31日

hsBAFF上调胞内钙离子激活B淋巴细胞的信号转导网络免疫调控机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

Ter94在Hedgehog信号转导途径中的作用机理

国家自然科学基金

0+阅读 · 2009年12月31日

基于几何约束lifting技术的细分小波变换研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员