Incep Former: 高效的受孕变形器,配有用于语义分割的金字石集合 (IncepFormer: Efficient Inception Transformer with Pyramid Pooling for Semantic Segmentation) - 专知论文

会员服务 ·

0

Pyramid · 变换 · 层 · 汇聚 · Extensibility ·

2022 年 12 月 6 日

IncepFormer: Efficient Inception Transformer with Pyramid Pooling for Semantic Segmentation

翻译：Incep Former: 高效的受孕变形器,配有用于语义分割的金字石集合

Lihua Fu,Haoyue Tian,Xiangping Bryce Zhai,Pan Gao,Xiaojiang Peng

from arxiv, Preprint with 8 pages of main body and 3 pages of supplementary material

Semantic segmentation usually benefits from global contexts, fine localisation information, multi-scale features, etc. To advance Transformer-based segmenters with these aspects, we present a simple yet powerful semantic segmentation architecture, termed as IncepFormer. IncepFormer has two critical contributions as following. First, it introduces a novel pyramid structured Transformer encoder which harvests global context and fine localisation features simultaneously. These features are concatenated and fed into a convolution layer for final per-pixel prediction. Second, IncepFormer integrates an Inception-like architecture with depth-wise convolutions, and a light-weight feed-forward module in each self-attention layer, efficiently obtaining rich local multi-scale object features. Extensive experiments on five benchmarks show that our IncepFormer is superior to state-of-the-art methods in both accuracy and speed, e.g., 1) our IncepFormer-S achieves 47.7% mIoU on ADE20K which outperforms the existing best method by 1% while only costs half parameters and fewer FLOPs. 2) Our IncepFormer-B finally achieves 82.0% mIoU on Cityscapes dataset with 39.6M parameters. Code is available:github.com/shendu0321/IncepFormer.

翻译：语义分解通常受益于全球背景、精细的本地化信息、多尺度特征等。为推进以变异器为基础的分解器, 我们展示了一个简单而强大的语义分解结构, 称为 IncepFormer。 IncepFormer 有两个重要贡献如下。首先, 它引入了一个新的金字塔结构结构化变异器编码器, 以同时捕获全球背景和精细的本地化特征。这些特征被结合并输入一个卷变层, 用于最终的每像素预测。其次, IncepFormer 将一个类似概念的架构与深度进化器相融合, 并在每个自我注意层中推出一个轻巧又强大的语义化的语义化分解结构, 被称之为 IncepForup-formation 结构, 被有效地获取丰富的本地多尺度对象特征。在五个基准上进行的广泛实验表明, 我们的 Inepformermer 以精确和速度两种方式优于最先进的方法, 例如, 1, 我们的 InepFO-LOP 2, 我们的代码/Ms Forpres Forpres for

0

相关内容

Pyramid

Pyramid is a small, fast, down-to-earth Python web application development framework.

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【CVPR 2022】采用稀疏Transformer的单步法三维物体检测器，Embracing Single Stride 3D Object Detector with Sparse Transformer

【CVPR 2022】采用稀疏Transformer的单步法三维物体检测器，Embracing Single Stride 3D Object Detector with Sparse Transformer

专知会员服务

5+阅读 · 2022年3月12日

图像分割二十年，盘点影响力最大的10篇论文

专知会员服务

84+阅读 · 2020年9月27日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

【ICLR2020论文】自我注意力与卷积层的关系，On the Relationship between Self-Attention and Convolutional Layers

【ICLR2020论文】自我注意力与卷积层的关系，On the Relationship between Self-Attention and Convolutional Layers

专知会员服务

37+阅读 · 2020年1月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

《pyramid Attention Network for Semantic Segmentation》

《pyramid Attention Network for Semantic Segmentation》

统计学习与视觉计算组

44+阅读 · 2018年8月30日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

新胰腺癌促癌基因CNTD2调控胰腺癌细胞恶性增殖的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Serglycin调控TGF-β信号通路诱导EMT促进膀胱癌转移机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

各向同性和TI弹性波方程高精度有限差分数值解法新方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

早期生长反应因子Egr-1调控气道粘液蛋白MUC5AC的作用及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

多尺度区域综合社会经济不平衡对居民健康的影响

国家自然科学基金

0+阅读 · 2013年12月31日

PPARγ拮抗Egr-1对增生性瘢痕TGF-β1促纤维化信号的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

Chemerin通过P38MAPK途径介导糖尿病肾病及硫辛酸干预研究

国家自然科学基金

0+阅读 · 2012年12月31日

Gax基因干预chemerin介导的血管周围脂肪细胞增殖分化的研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于先验形状束的前列腺CT图像自动分割新方法研究

国家自然科学基金

1+阅读 · 2009年12月31日

Erbin在细胞分裂周期中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

SeaFormer: Squeeze-enhanced Axial Transformer for Mobile Semantic Segmentation

Arxiv

0+阅读 · 2023年2月8日

Geometric Perception based Efficient Text Recognition

Arxiv

0+阅读 · 2023年2月8日

Global Context Vision Transformers

Arxiv

0+阅读 · 2023年2月6日

PriorLane: A Prior Knowledge Enhanced Lane Detection Approach Based on Transformer

Arxiv

0+阅读 · 2023年2月6日

Deep Instance Segmentation with Automotive Radar Detection Points

Arxiv

0+阅读 · 2023年2月5日

Temporal Relational Modeling with Self-Supervision for Action Segmentation

Arxiv

13+阅读 · 2020年12月14日

Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector

Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector

Arxiv

17+阅读 · 2020年3月31日

RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds

Arxiv

11+阅读 · 2019年11月25日

Hierarchical Graph Pooling with Structure Learning

Arxiv

13+阅读 · 2019年11月14日

Self-Attention Graph Pooling

Self-Attention Graph Pooling

Arxiv

13+阅读 · 2019年6月13日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【CVPR 2022】采用稀疏Transformer的单步法三维物体检测器，Embracing Single Stride 3D Object Detector with Sparse Transformer

【CVPR 2022】采用稀疏Transformer的单步法三维物体检测器，Embracing Single Stride 3D Object Detector with Sparse Transformer

专知会员服务

5+阅读 · 2022年3月12日

图像分割二十年，盘点影响力最大的10篇论文

专知会员服务

84+阅读 · 2020年9月27日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

【ICLR2020论文】自我注意力与卷积层的关系，On the Relationship between Self-Attention and Convolutional Layers

【ICLR2020论文】自我注意力与卷积层的关系，On the Relationship between Self-Attention and Convolutional Layers

专知会员服务

37+阅读 · 2020年1月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津博士论文】零样本强化学习综述

《美军条令：陆军指挥官与规划人员地理空间指南》60页

战术边缘指挥控制：防务面临的核心挑战

迈向开放世界检测：综述

相关资讯

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

《pyramid Attention Network for Semantic Segmentation》

《pyramid Attention Network for Semantic Segmentation》

统计学习与视觉计算组

44+阅读 · 2018年8月30日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

相关论文

SeaFormer: Squeeze-enhanced Axial Transformer for Mobile Semantic Segmentation

Arxiv

0+阅读 · 2023年2月8日

Geometric Perception based Efficient Text Recognition

Arxiv

0+阅读 · 2023年2月8日

Global Context Vision Transformers

Arxiv

0+阅读 · 2023年2月6日

PriorLane: A Prior Knowledge Enhanced Lane Detection Approach Based on Transformer

Arxiv

0+阅读 · 2023年2月6日

Deep Instance Segmentation with Automotive Radar Detection Points

Arxiv

0+阅读 · 2023年2月5日

Temporal Relational Modeling with Self-Supervision for Action Segmentation

Arxiv

13+阅读 · 2020年12月14日

Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector

Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector

Arxiv

17+阅读 · 2020年3月31日

RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds

Arxiv

11+阅读 · 2019年11月25日

Hierarchical Graph Pooling with Structure Learning

Arxiv

13+阅读 · 2019年11月14日

Self-Attention Graph Pooling

Self-Attention Graph Pooling

Arxiv

13+阅读 · 2019年6月13日

相关基金

新胰腺癌促癌基因CNTD2调控胰腺癌细胞恶性增殖的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Serglycin调控TGF-β信号通路诱导EMT促进膀胱癌转移机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

各向同性和TI弹性波方程高精度有限差分数值解法新方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

早期生长反应因子Egr-1调控气道粘液蛋白MUC5AC的作用及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

多尺度区域综合社会经济不平衡对居民健康的影响

国家自然科学基金

0+阅读 · 2013年12月31日

PPARγ拮抗Egr-1对增生性瘢痕TGF-β1促纤维化信号的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

Chemerin通过P38MAPK途径介导糖尿病肾病及硫辛酸干预研究

国家自然科学基金

0+阅读 · 2012年12月31日

Gax基因干预chemerin介导的血管周围脂肪细胞增殖分化的研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于先验形状束的前列腺CT图像自动分割新方法研究

国家自然科学基金

1+阅读 · 2009年12月31日

Erbin在细胞分裂周期中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员