轻量级图像超分辨率的全息聚合网络 (Omni Aggregation Networks for Lightweight Image Super-Resolution) - 专知论文

会员服务 ·

0

超分辨率 · 图像超分辨率 · 超分 · 图像超分 · SR ·

2023 年 4 月 20 日

Omni Aggregation Networks for Lightweight Image Super-Resolution

翻译：轻量级图像超分辨率的全息聚合网络

Hang Wang,Xuanhong Chen,Bingbing Ni,Yutian Liu,Jinfan Liu

from arxiv, Accepted by CVPR2023. Code is available at \url{https://github.com/Francis0625/Omni-SR}

While lightweight ViT framework has made tremendous progress in image super-resolution, its uni-dimensional self-attention modeling, as well as homogeneous aggregation scheme, limit its effective receptive field (ERF) to include more comprehensive interactions from both spatial and channel dimensions. To tackle these drawbacks, this work proposes two enhanced components under a new Omni-SR architecture. First, an Omni Self-Attention (OSA) block is proposed based on dense interaction principle, which can simultaneously model pixel-interaction from both spatial and channel dimensions, mining the potential correlations across omni-axis (i.e., spatial and channel). Coupling with mainstream window partitioning strategies, OSA can achieve superior performance with compelling computational budgets. Second, a multi-scale interaction scheme is proposed to mitigate sub-optimal ERF (i.e., premature saturation) in shallow models, which facilitates local propagation and meso-/global-scale interactions, rendering an omni-scale aggregation building block. Extensive experiments demonstrate that Omni-SR achieves record-high performance on lightweight super-resolution benchmarks (e.g., 26.95 dB@Urban100 $\times 4$ with only 792K parameters). Our code is available at \url{https://github.com/Francis0625/Omni-SR}.

翻译：虽然轻量级ViT框架在图像超分辨率方面取得了巨大进展，但其一维自注意建模以及同质聚合方案限制了其有效感受野（ERF）以包括更全面的空间和通道维度的交互。针对这些缺点，本文在新的Omni-SR架构下提出了两个增强组件。首先，基于密集交互原则提出了一个Omni Self-Attention (OSA)模块，它可以同时从空间和通道维度模型化像素-交互，挖掘来自全息轴（即空间和通道）的潜在相关性。结合主流的窗口分割策略，OSA可以在有吸引力的计算预算下实现优越的性能。其次，提出了多尺度交互方案，以缓解浅层模型中的子优感受野（即过早饱和），这有助于本地传播和中位/全局尺度的交互，形成全息尺度聚合构建块。广泛的实验表明，Omni-SR在轻量级超分辨率基准测试上取得了最高性能记录（例如，具有仅792K参数的Urban100 $\times 4$的26.95 dB）。我们的代码可在\url{https://github.com/Francis0625/Omni-SR}上获得。

0

相关内容

超分辨率

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

图像分割二十年，盘点影响力最大的10篇论文

图像分割二十年，盘点影响力最大的10篇论文

专知会员服务

45+阅读 · 2022年2月7日

NeurIPS 2021 | 又一超强视觉Transformer主干！HRFormer：学习高分辨率表征

NeurIPS 2021 | 又一超强视觉Transformer主干！HRFormer：学习高分辨率表征

专知会员服务

18+阅读 · 2021年12月8日

【CVPR2021】重新思考BiSeNet让语义分割模型速度起飞

【CVPR2021】重新思考BiSeNet让语义分割模型速度起飞

专知会员服务

34+阅读 · 2021年5月5日

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

专知会员服务

44+阅读 · 2020年3月26日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

开放知识图谱

2+阅读 · 2022年5月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

14+阅读 · 2019年5月15日

本周精选共读论文《计算机视觉图像分割》六篇

本周精选共读论文《计算机视觉图像分割》六篇

人工智能前沿讲习班

10+阅读 · 2019年4月1日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

语义分割+视频分割开源代码集合

语义分割+视频分割开源代码集合

极市平台

35+阅读 · 2018年3月5日

【推荐】全卷积语义分割综述

【推荐】全卷积语义分割综述

机器学习研究会

19+阅读 · 2017年8月31日

金属碳化物基低铂介孔催化材料的合成、界面设计与电催化性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

3D多孔结构LiMnPO4•LiVPO4F@石墨烯气凝胶复合物材料的构筑及电化学性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

多分辨率相机及图像超分辨率技术研究

国家自然科学基金

2+阅读 · 2014年12月31日

面向无线传感器网络的分布式压缩感知测量矩阵的优化设计

国家自然科学基金

1+阅读 · 2013年12月31日

基于二维导电网络的LiMnPO4体系@石墨烯材料的功能调控及其储锂特性

国家自然科学基金

0+阅读 · 2013年12月31日

“异维结构”光电器件的设计、制备及性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于SHMS的大跨悬索桥强/台风作用及其效应的精细化研究

国家自然科学基金

0+阅读 · 2009年12月31日

具有低温有序的[FePt/Au]10垂直磁记录介质材料的制备与表征

国家自然科学基金

0+阅读 · 2009年12月31日

基于动力学分析的Internet网络拥塞控制研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于碳纳米管/石墨烯自组装复合结构的透明导电薄膜的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Instructive Feature Enhancement for Dichotomous Medical Image Segmentation

Arxiv

0+阅读 · 2023年6月6日

MetaGait: Learning to Learn an Omni Sample Adaptive Representation for Gait Recognition

Arxiv

0+阅读 · 2023年6月6日

DFormer: Diffusion-guided Transformer for Universal Image Segmentation

Arxiv

1+阅读 · 2023年6月6日

NNSplitter: An Active Defense Solution for DNN Model via Automated Weight Obfuscation

Arxiv

0+阅读 · 2023年6月3日

A Feature Reuse Framework with Texture-adaptive Aggregation for Reference-based Super-Resolution

Arxiv

0+阅读 · 2023年6月2日

Transformers in Time Series: A Survey

Arxiv

34+阅读 · 2022年2月15日

MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Arxiv

13+阅读 · 2021年1月5日

Hierarchical Graph Pooling with Structure Learning

Arxiv

13+阅读 · 2019年11月14日

Unsupervised Cross-Modality Domain Adaptation of ConvNets for Biomedical Image Segmentations with Adversarial Loss

Arxiv

10+阅读 · 2018年4月29日

An application of cascaded 3D fully convolutional networks for medical image segmentation

Arxiv

10+阅读 · 2018年3月20日

VIP会员

文章信息

相关主题

图像超分辨率

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

图像分割二十年，盘点影响力最大的10篇论文

图像分割二十年，盘点影响力最大的10篇论文

专知会员服务

45+阅读 · 2022年2月7日

NeurIPS 2021 | 又一超强视觉Transformer主干！HRFormer：学习高分辨率表征

NeurIPS 2021 | 又一超强视觉Transformer主干！HRFormer：学习高分辨率表征

专知会员服务

18+阅读 · 2021年12月8日

【CVPR2021】重新思考BiSeNet让语义分割模型速度起飞

【CVPR2021】重新思考BiSeNet让语义分割模型速度起飞

专知会员服务

34+阅读 · 2021年5月5日

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

专知会员服务

44+阅读 · 2020年3月26日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

开放知识图谱

2+阅读 · 2022年5月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

14+阅读 · 2019年5月15日

本周精选共读论文《计算机视觉图像分割》六篇

本周精选共读论文《计算机视觉图像分割》六篇

人工智能前沿讲习班

10+阅读 · 2019年4月1日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

语义分割+视频分割开源代码集合

语义分割+视频分割开源代码集合

极市平台

35+阅读 · 2018年3月5日

【推荐】全卷积语义分割综述

【推荐】全卷积语义分割综述

机器学习研究会

19+阅读 · 2017年8月31日

相关论文

Instructive Feature Enhancement for Dichotomous Medical Image Segmentation

Arxiv

0+阅读 · 2023年6月6日

MetaGait: Learning to Learn an Omni Sample Adaptive Representation for Gait Recognition

Arxiv

0+阅读 · 2023年6月6日

DFormer: Diffusion-guided Transformer for Universal Image Segmentation

Arxiv

1+阅读 · 2023年6月6日

NNSplitter: An Active Defense Solution for DNN Model via Automated Weight Obfuscation

Arxiv

0+阅读 · 2023年6月3日

A Feature Reuse Framework with Texture-adaptive Aggregation for Reference-based Super-Resolution

Arxiv

0+阅读 · 2023年6月2日

Transformers in Time Series: A Survey

Arxiv

34+阅读 · 2022年2月15日

MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Arxiv

13+阅读 · 2021年1月5日

Hierarchical Graph Pooling with Structure Learning

Arxiv

13+阅读 · 2019年11月14日

Unsupervised Cross-Modality Domain Adaptation of ConvNets for Biomedical Image Segmentations with Adversarial Loss

Arxiv

10+阅读 · 2018年4月29日

An application of cascaded 3D fully convolutional networks for medical image segmentation

Arxiv

10+阅读 · 2018年3月20日

相关基金

金属碳化物基低铂介孔催化材料的合成、界面设计与电催化性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

3D多孔结构LiMnPO4•LiVPO4F@石墨烯气凝胶复合物材料的构筑及电化学性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

多分辨率相机及图像超分辨率技术研究

国家自然科学基金

2+阅读 · 2014年12月31日

面向无线传感器网络的分布式压缩感知测量矩阵的优化设计

国家自然科学基金

1+阅读 · 2013年12月31日

基于二维导电网络的LiMnPO4体系@石墨烯材料的功能调控及其储锂特性

国家自然科学基金

0+阅读 · 2013年12月31日

“异维结构”光电器件的设计、制备及性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于SHMS的大跨悬索桥强/台风作用及其效应的精细化研究

国家自然科学基金

0+阅读 · 2009年12月31日

具有低温有序的[FePt/Au]10垂直磁记录介质材料的制备与表征

国家自然科学基金

0+阅读 · 2009年12月31日

基于动力学分析的Internet网络拥塞控制研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于碳纳米管/石墨烯自组装复合结构的透明导电薄膜的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员