WaveFill:一个基于 Wavelet 的图像油漆生成网络 (WaveFill: A Wavelet-based Generation Network for Image Inpainting) - 专知论文

会员服务 ·

0

图像修复 · 生成器网络 · Extensibility · Networking · 分解 ·

2021 年 7 月 23 日

WaveFill: A Wavelet-based Generation Network for Image Inpainting

翻译：WaveFill:一个基于 Wavelet 的图像油漆生成网络

Yingchen Yu,Fangneng Zhan,Shijian Lu,Jianxiong Pan,Feiying Ma,Xuansong Xie,Chunyan Miao

from arxiv, 10 pages, 7 figures

Image inpainting aims to complete the missing or corrupted regions of images with realistic contents. The prevalent approaches adopt a hybrid objective of reconstruction and perceptual quality by using generative adversarial networks. However, the reconstruction loss and adversarial loss focus on synthesizing contents of different frequencies and simply applying them together often leads to inter-frequency conflicts and compromised inpainting. This paper presents WaveFill, a wavelet-based inpainting network that decomposes images into multiple frequency bands and fills the missing regions in each frequency band separately and explicitly. WaveFill decomposes images by using discrete wavelet transform (DWT) that preserves spatial information naturally. It applies L1 reconstruction loss to the decomposed low-frequency bands and adversarial loss to high-frequency bands, hence effectively mitigate inter-frequency conflicts while completing images in spatial domain. To address the inpainting inconsistency in different frequency bands and fuse features with distinct statistics, we design a novel normalization scheme that aligns and fuses the multi-frequency features effectively. Extensive experiments over multiple datasets show that WaveFill achieves superior image inpainting qualitatively and quantitatively.

翻译：图像映射旨在用现实内容完成丢失或腐败的图像区域。流行的方法通过使用基因对抗网络,采用重建和感知质量的混合目标。但是, 重建损失和对抗性损失侧重于将不同频率的内容合成,只是将它们一起应用,往往导致频率间冲突,并损害映射。本文展示了WaveFill, 以波盘为基的涂漆网络, 将图像分解成多个频带, 并单独和明确地填充每个频带中的缺失区域。 WaveFill 使用独立波盘变换( DWT) 将图像分解, 从而自然保存空间信息。它将L1 重建损失应用于已分解的低频波段和对抗性损失到高频段, 从而有效地减轻频率间冲突, 同时完成空间域的图像。为了解决不同频带和引信特性中不统一不统一的问题, 我们设计了一个新的正常化计划, 有效地将多频带特性相匹配和结合。对多个数据集进行的广泛实验显示, WaveFill 将图像在质量和定量上达到更佳的图像。

0

相关内容

图像修复

图像修复（英语：Inpainting）指重建的图像和视频中丢失或损坏的部分的过程。例如在博物馆中，这项工作常由经验丰富的博物馆管理员或者艺术品修复师来进行。数码世界中，图像修复又称图像插值或视频插值，指利用复杂的算法来替换已丢失、损坏的图像数据，主要替换一些小区域和瑕疵。

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知会员服务

78+阅读 · 2020年7月23日

商业数据分析，39页ppt

商业数据分析，39页ppt

专知会员服务

165+阅读 · 2020年6月2日

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

专知会员服务

44+阅读 · 2020年3月26日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

AI科技评论

21+阅读 · 2019年6月23日

CVPR2019| 05-07更新14篇论文及代码合集（1篇oral，含目标检测/视频分割/目标跟踪等）

CVPR2019| 05-07更新14篇论文及代码合集（1篇oral，含目标检测/视频分割/目标跟踪等）

极市平台

22+阅读 · 2019年5月7日

CVPR2019| 04-16更新48篇论文及代码（9篇oral、含行人检测、图像生成、步态识别等）

CVPR2019| 04-16更新48篇论文及代码（9篇oral、含行人检测、图像生成、步态识别等）

极市平台

33+阅读 · 2019年4月16日

CVPR2019| 04-03更新10篇论文及代码（3篇oral、含GAN、文本图像生成等）

CVPR2019| 04-03更新10篇论文及代码（3篇oral、含GAN、文本图像生成等）

极市平台

18+阅读 · 2019年4月3日

CVPR2019 | 03-20日更新11篇论文及代码汇总（含1篇oral，目标识别、行人检测、VQA、立体匹配等）

CVPR2019 | 03-20日更新11篇论文及代码汇总（含1篇oral，目标识别、行人检测、VQA、立体匹配等）

极市平台

50+阅读 · 2019年3月20日

CVPR2019 | 03-17日更新6篇论文及代码汇总（图像分类、GAN、图像超分辨等）

CVPR2019 | 03-17日更新6篇论文及代码汇总（图像分类、GAN、图像超分辨等）

极市平台

13+阅读 · 2019年3月17日

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

机器学习研究会

12+阅读 · 2018年1月27日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Knowledge-Embedded Routing Network for Scene Graph Generation

Arxiv

5+阅读 · 2019年3月8日

Foreground-aware Image Inpainting

Foreground-aware Image Inpainting

Arxiv

4+阅读 · 2019年1月17日

Using Scene Graph Context to Improve Image Generation

Using Scene Graph Context to Improve Image Generation

Arxiv

3+阅读 · 2019年1月15日

Generative Dual Adversarial Network for Generalized Zero-shot Learning

Arxiv

7+阅读 · 2018年11月12日

MEGAN: Mixture of Experts of Generative Adversarial Networks for Multimodal Image Generation

Arxiv

4+阅读 · 2018年5月8日

Pose-Normalized Image Generation for Person Re-identification

Arxiv

5+阅读 · 2018年4月25日

Attentive Generative Adversarial Network for Raindrop Removal from a Single Image

Arxiv

3+阅读 · 2018年4月1日

Disentangled Person Image Generation

Arxiv

7+阅读 · 2018年1月21日

Semi-supervised FusedGAN for Conditional Image Generation

Arxiv

8+阅读 · 2018年1月17日

LR-GAN: Layered Recursive Generative Adversarial Networks for Image Generation

Arxiv

3+阅读 · 2017年8月2日

VIP会员

文章信息

相关主题

生成器网络

相关VIP内容

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知会员服务

78+阅读 · 2020年7月23日

商业数据分析，39页ppt

商业数据分析，39页ppt

专知会员服务

165+阅读 · 2020年6月2日

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

专知会员服务

44+阅读 · 2020年3月26日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【ACMMM2025教程】打击网络虚假信息视频：特征分析、检测与防范，170页ppt

海军无人系统：海上作战的演进而非革命

Nature 子刊 | SciToolAgent:知识图谱引导的科学工具智能体

多媒体顶会ACM Multimedia 2025各大奖项揭晓！格拉斯哥大学等获最佳论文，中科院自动化所等获最佳学生论文

相关资讯

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

AI科技评论

21+阅读 · 2019年6月23日

CVPR2019| 05-07更新14篇论文及代码合集（1篇oral，含目标检测/视频分割/目标跟踪等）

CVPR2019| 05-07更新14篇论文及代码合集（1篇oral，含目标检测/视频分割/目标跟踪等）

极市平台

22+阅读 · 2019年5月7日

CVPR2019| 04-16更新48篇论文及代码（9篇oral、含行人检测、图像生成、步态识别等）

CVPR2019| 04-16更新48篇论文及代码（9篇oral、含行人检测、图像生成、步态识别等）

极市平台

33+阅读 · 2019年4月16日

CVPR2019| 04-03更新10篇论文及代码（3篇oral、含GAN、文本图像生成等）

CVPR2019| 04-03更新10篇论文及代码（3篇oral、含GAN、文本图像生成等）

极市平台

18+阅读 · 2019年4月3日

CVPR2019 | 03-20日更新11篇论文及代码汇总（含1篇oral，目标识别、行人检测、VQA、立体匹配等）

CVPR2019 | 03-20日更新11篇论文及代码汇总（含1篇oral，目标识别、行人检测、VQA、立体匹配等）

极市平台

50+阅读 · 2019年3月20日

CVPR2019 | 03-17日更新6篇论文及代码汇总（图像分类、GAN、图像超分辨等）

CVPR2019 | 03-17日更新6篇论文及代码汇总（图像分类、GAN、图像超分辨等）

极市平台

13+阅读 · 2019年3月17日

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

机器学习研究会

12+阅读 · 2018年1月27日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Knowledge-Embedded Routing Network for Scene Graph Generation

Arxiv

5+阅读 · 2019年3月8日

Foreground-aware Image Inpainting

Foreground-aware Image Inpainting

Arxiv

4+阅读 · 2019年1月17日

Using Scene Graph Context to Improve Image Generation

Using Scene Graph Context to Improve Image Generation

Arxiv

3+阅读 · 2019年1月15日

Generative Dual Adversarial Network for Generalized Zero-shot Learning

Arxiv

7+阅读 · 2018年11月12日

MEGAN: Mixture of Experts of Generative Adversarial Networks for Multimodal Image Generation

Arxiv

4+阅读 · 2018年5月8日

Pose-Normalized Image Generation for Person Re-identification

Arxiv

5+阅读 · 2018年4月25日

Attentive Generative Adversarial Network for Raindrop Removal from a Single Image

Arxiv

3+阅读 · 2018年4月1日

Disentangled Person Image Generation

Arxiv

7+阅读 · 2018年1月21日

Semi-supervised FusedGAN for Conditional Image Generation

Arxiv

8+阅读 · 2018年1月17日

LR-GAN: Layered Recursive Generative Adversarial Networks for Image Generation

Arxiv

3+阅读 · 2017年8月2日

微信扫码咨询专知VIP会员