Snowformer: 单一图像脱下时通过上下文互动实现的有天分的变异器 (SnowFormer: Scale-aware Transformer via Context Interaction for Single Image Desnowing) - 专知论文

会员服务 ·

0

INTERACT · Projection · INFORMS · Extensibility · 变换 ·

2022 年 8 月 23 日

SnowFormer: Scale-aware Transformer via Context Interaction for Single Image Desnowing

翻译：Snowformer: 单一图像脱下时通过上下文互动实现的有天分的变异器

Sixiang Chen,Tian Ye,Yun Liu,Erkang Chen,Jun Shi,Jingchun Zhou

Single image desnowing is a common yet challenging task. The complex snow degradations and diverse degradation scales demand strong representation ability. In order for the desnowing network to see various snow degradations and model the context interaction of local details and global information, we propose a powerful architecture dubbed as SnowFormer. First, it performs Scale-aware Feature Aggregation in the encoder to capture rich snow information of various degradations. Second, in order to tackle with large-scale degradation, it uses a novel Context Interaction Transformer Block in the decoder, which conducts context interaction of local details and global information from previous scale-aware feature aggregation in global context interaction. And the introduction of local context interaction improves recovery of scene details. Third, we devise a Heterogeneous Feature Projection Head which progressively fuse features from both the encoder and decoder and project the refined feature into the clean image. Extensive experiments demonstrate that the proposed SnowFormer achieves significant improvements over other SOTA methods. Compared with SOTA single image desnowing method HDCW-Net, it boosts the PSNR metric by 9.2dB on the CSD testset. Moreover, it also achieves a 5.13dB increase in PSNR compared with general image restoration architecture NAFNet, which verifies the strong representation ability of our SnowFormer for snow removal task. The code is released in \url{https://github.com/Ephemeral182/SnowFormer}.

翻译：单一图像的淡化是一项共同但具有挑战性的任务。复杂的降雪退化和不同的降解规模要求强大的代表能力。为了让降雪网络看到各种降雪的退化,并模拟当地细节和全球信息的背景互动。为了让降雪网络看到各种降雪的退化和模拟当地细节和全球信息的背景互动,我们建议了一个称为“Snow Former”的强大架构。首先,它在编码器中安装了比例觉觉的特征聚合,以捕捉各种退化的丰富雪信息。第二,为了应对大规模退化,它使用一个新型的环境互动变异块在拆解器中,它使用一个新的环境变异器。在拆解器中,它进行地方细节和以前规模变异功能组合的全球信息的背景互动。为了让当地环境互动能够改善现场细节的恢复。第三,我们设计了一个超强的地变异功能投影头,把精细的功能与各种变形的变形图像投影集集集到Scial-Stual-Net上,它能提升了PSNRIS/Stual的图像变影化能力。

0

相关内容

INTERACT

IFIP TC13 Conference on Human-Computer Interaction是人机交互领域的研究者和实践者展示其工作的重要平台。多年来，这些会议吸引了来自几个国家和文化的研究人员。官网链接：http://interact2019.org/

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

专知会员服务

27+阅读 · 2022年3月3日

NeurIPS 2021教程|OpenAI-Lilian Weng等：自监督学习与对比学习，105页ppt，

NeurIPS 2021教程|OpenAI-Lilian Weng等：自监督学习与对比学习，105页ppt，

专知会员服务

78+阅读 · 2021年12月10日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

利用GPS观测资料反演高时空分辨率局部地表质量变化的方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

网络环境下基于视觉显著性的图像检索

国家自然科学基金

1+阅读 · 2014年12月31日

Alzheimer病脑内BDNF/TrkB信号时空特异性调控LIMK1蛋白并保护突触功能的机制及应用

国家自然科学基金

0+阅读 · 2013年12月31日

基于视觉皮层信息处理机制的行人检测与行为识别

国家自然科学基金

0+阅读 · 2013年12月31日

ATF3在前列腺癌雄激素非依赖性形成中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

抑制TGF-β1/Smad 信号通路促进骨-肌腱结合部瘢痕愈合后的软骨性重塑

国家自然科学基金

0+阅读 · 2011年12月31日

Reality-based Interaction用户界面模型和评估方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

Erbin在细胞分裂周期中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

基于Surfacelet多尺度积的三维SAR图像去噪与分割

国家自然科学基金

0+阅读 · 2009年12月31日

基于AlGaAs光波导产生超宽带(UWB)脉冲的新机理新技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

Temporally Consistent Video Transformer for Long-Term Video Prediction

Arxiv

0+阅读 · 2022年10月5日

Dual-former: Hybrid Self-attention Transformer for Efficient Image Restoration

Arxiv

0+阅读 · 2022年10月3日

A Strong Transfer Baseline for RGB-D Fusion in Vision Transformers

Arxiv

0+阅读 · 2022年10月3日

Merging Classification Predictions with Sequential Information for Lightweight Visual Place Recognition in Changing Environments

Arxiv

0+阅读 · 2022年10月3日

Fully Transformer Network for Change Detection of Remote Sensing Images

Arxiv

1+阅读 · 2022年10月3日

Patch-Based Stochastic Attention for Image Editing

Arxiv

0+阅读 · 2022年9月30日

Learning to Estimate Shapley Values with Vision Transformers

Arxiv

0+阅读 · 2022年9月30日

Hierarchical Label-wise Attention Transformer Model for Explainable ICD Coding

Arxiv

0+阅读 · 2022年9月30日

Nested Hierarchical Transformer: Towards Accurate, Data-Efficient and Interpretable Visual Understanding

Arxiv

12+阅读 · 2021年12月30日

A Survey of Visual Transformers

Arxiv

39+阅读 · 2021年11月11日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

专知会员服务

27+阅读 · 2022年3月3日

NeurIPS 2021教程|OpenAI-Lilian Weng等：自监督学习与对比学习，105页ppt，

NeurIPS 2021教程|OpenAI-Lilian Weng等：自监督学习与对比学习，105页ppt，

专知会员服务

78+阅读 · 2021年12月10日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

小规模训练指南：打造世界级大语言模型的关键方法

无人机编队飞行：复杂环境中作战的策略、挑战与应用

大模型APP，AI时代第一个爆款

从数据中心视角出发的高效大语言模型训练综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

相关论文

Temporally Consistent Video Transformer for Long-Term Video Prediction

Arxiv

0+阅读 · 2022年10月5日

Dual-former: Hybrid Self-attention Transformer for Efficient Image Restoration

Arxiv

0+阅读 · 2022年10月3日

A Strong Transfer Baseline for RGB-D Fusion in Vision Transformers

Arxiv

0+阅读 · 2022年10月3日

Merging Classification Predictions with Sequential Information for Lightweight Visual Place Recognition in Changing Environments

Arxiv

0+阅读 · 2022年10月3日

Fully Transformer Network for Change Detection of Remote Sensing Images

Arxiv

1+阅读 · 2022年10月3日

Patch-Based Stochastic Attention for Image Editing

Arxiv

0+阅读 · 2022年9月30日

Learning to Estimate Shapley Values with Vision Transformers

Arxiv

0+阅读 · 2022年9月30日

Hierarchical Label-wise Attention Transformer Model for Explainable ICD Coding

Arxiv

0+阅读 · 2022年9月30日

Nested Hierarchical Transformer: Towards Accurate, Data-Efficient and Interpretable Visual Understanding

Arxiv

12+阅读 · 2021年12月30日

A Survey of Visual Transformers

Arxiv

39+阅读 · 2021年11月11日

相关基金

利用GPS观测资料反演高时空分辨率局部地表质量变化的方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

网络环境下基于视觉显著性的图像检索

国家自然科学基金

1+阅读 · 2014年12月31日

Alzheimer病脑内BDNF/TrkB信号时空特异性调控LIMK1蛋白并保护突触功能的机制及应用

国家自然科学基金

0+阅读 · 2013年12月31日

基于视觉皮层信息处理机制的行人检测与行为识别

国家自然科学基金

0+阅读 · 2013年12月31日

ATF3在前列腺癌雄激素非依赖性形成中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

抑制TGF-β1/Smad 信号通路促进骨-肌腱结合部瘢痕愈合后的软骨性重塑

国家自然科学基金

0+阅读 · 2011年12月31日

Reality-based Interaction用户界面模型和评估方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

Erbin在细胞分裂周期中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

基于Surfacelet多尺度积的三维SAR图像去噪与分割

国家自然科学基金

0+阅读 · 2009年12月31日

基于AlGaAs光波导产生超宽带(UWB)脉冲的新机理新技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员