为实时语义分割重新思考被破坏的革命 (Rethink Dilated Convolution for Real-time Semantic Segmentation) - 专知论文

会员服务 ·

0

Backbone · 膨胀卷积 · FPS · 卷积 · ImageNet (数据集) ·

2021 年 12 月 27 日

Rethink Dilated Convolution for Real-time Semantic Segmentation

翻译：为实时语义分割重新思考被破坏的革命

Recent advances in semantic segmentation generally adapt an ImageNet pretrained backbone with a special context module after it to quickly increase the field-of-view. Although successful, the backbone, in which most of the computation lies, does not have a large enough field-of-view to make the best decisions. Some recent advances tackle this problem by rapidly downsampling the resolution in the backbone while also having one or more parallel branches with higher resolutions. We take a different approach by designing a ResNeXt inspired block structure that uses two parallel 3x3 convolutional layers with different dilation rates to increase the field-of-view while also preserving the local details. By repeating this block structure in the backbone, we do not need to append any special context module after it. In addition, we propose a lightweight decoder that restores local information better than common alternatives. To demonstrate the effectiveness of our approach, our model RegSeg achieves state-of-the-art results on real-time Cityscapes and CamVid datasets. Using a T4 GPU with mixed precision, RegSeg achieves 78.3 mIOU on Cityscapes test set at 30 FPS, and 80.9 mIOU on CamVid test set at 70 FPS, both without ImageNet pretraining.

翻译：语义分割的最近进展一般地使图像网络先入为主的骨干经过预先训练, 并有一个特殊背景模块, 以迅速增加视野。虽然这个骨干是成功的, 但大部分计算所在的骨干没有大到足以做出最佳决定的视野。最近的一些进展通过在骨干中快速地减少对分辨率的描述来解决这个问题, 同时有一个或几个具有较高分辨率的平行分支。我们采取不同的方法, 设计一个ResNeXt 启发性的块结构, 使用两个平行的3x3进化层, 具有不同变异率, 以提高视野范围, 同时保存本地细节。通过在骨干中重复这一块结构, 我们不需要在任何特殊背景模块之后再附加任何特定的外观模块。此外, 我们提议了一个轻量的解码器, 来恢复本地信息比常见的替代品更好。为了展示我们的方法的有效性, 我们的模型RegSeggeg在实时城市景象和CamVid数据集上, 使用具有混合精度的T4 GPU, RegSegS在C- 803 mO 和FPS 在CMVI 30 测试中, 在CMPS 前的30MPS。

0

相关内容

Backbone

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

专知会员服务

21+阅读 · 2022年3月18日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

专知会员服务

26+阅读 · 2020年3月26日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

PyTorch语义分割开源库semseg

PyTorch语义分割开源库semseg

极市平台

25+阅读 · 2019年6月6日

深度卷积神经网络中的降采样

深度卷积神经网络中的降采样

极市平台

12+阅读 · 2019年5月24日

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

AI研习社

32+阅读 · 2019年4月5日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

《pyramid Attention Network for Semantic Segmentation》

《pyramid Attention Network for Semantic Segmentation》

统计学习与视觉计算组

44+阅读 · 2018年8月30日

【论文推荐】最新七篇图像分割相关论文—域适应深度表示学习、循环残差卷积、二值分割、图像合成、无监督跨模态

【论文推荐】最新七篇图像分割相关论文—域适应深度表示学习、循环残差卷积、二值分割、图像合成、无监督跨模态

专知

19+阅读 · 2018年6月1日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】全卷积语义分割综述

【推荐】全卷积语义分割综述

机器学习研究会

19+阅读 · 2017年8月31日

多特征融合与集成学习的城市高分辨率遥感影像变化检测

国家自然科学基金

4+阅读 · 2014年12月31日

基于IFC的建筑信息模型(BIM)语义检索技术研究

国家自然科学基金

1+阅读 · 2014年12月31日

目标跟踪中的时空上下文建模方法研究

国家自然科学基金

2+阅读 · 2013年12月31日

基于因子图的无线传感器网络分布式推理方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向图像分割的自适应脉冲耦合神经网络理论及应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向MapReduce的网络存储系统优化技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于水下无线传感网络的目标跟踪研究

国家自然科学基金

2+阅读 · 2012年12月31日

基于可移动摄像头的协同式安全监控和目标跟踪

国家自然科学基金

0+阅读 · 2012年12月31日

心脏植入电子装置早期感染的诊断研究

国家自然科学基金

0+阅读 · 2011年12月31日

InSAR支持下基于支持向量机的地震滑坡空间预测研究

国家自然科学基金

0+阅读 · 2009年12月31日

Real-Time Segmentation Networks should be Latency Aware

Arxiv

0+阅读 · 2022年4月20日

CSRNet: Cascaded Selective Resolution Network for Real-time Semantic Segmentation

Arxiv

0+阅读 · 2022年4月19日

Fast and Memory-Efficient Network Towards Efficient Image Super-Resolution

Fast and Memory-Efficient Network Towards Efficient Image Super-Resolution

Arxiv

0+阅读 · 2022年4月18日

BA-Net: Bridge Attention for Deep Convolutional Neural Networks

Arxiv

0+阅读 · 2022年4月18日

BDG-Net: Boundary Distribution Guided Network for Accurate Polyp Segmentation

Arxiv

0+阅读 · 2022年4月17日

Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation

Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation

Arxiv

0+阅读 · 2022年4月15日

Decoupling Zero-Shot Semantic Segmentation

Arxiv

0+阅读 · 2022年4月15日

K-Net: Towards Unified Image Segmentation

Arxiv

12+阅读 · 2021年11月1日

Deep Representation Learning for Domain Adaptation of Semantic Image Segmentation

Arxiv

10+阅读 · 2018年5月10日

An application of cascaded 3D fully convolutional networks for medical image segmentation

Arxiv

10+阅读 · 2018年3月20日

VIP会员

文章信息

相关主题

ImageNet (数据集)

相关VIP内容

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

专知会员服务

21+阅读 · 2022年3月18日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

专知会员服务

26+阅读 · 2020年3月26日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《驻地训练手册》美陆军最新72页

《量子隧穿认知神经网络在军民车辆识别与情感分析中的应用》最新论文

俄罗斯对乌克兰无人机作战的战略适应性分析

《美国海岸警卫队2028部队设计执行计划摘要》最新32页

相关资讯

PyTorch语义分割开源库semseg

PyTorch语义分割开源库semseg

极市平台

25+阅读 · 2019年6月6日

深度卷积神经网络中的降采样

深度卷积神经网络中的降采样

极市平台

12+阅读 · 2019年5月24日

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

AI研习社

32+阅读 · 2019年4月5日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

《pyramid Attention Network for Semantic Segmentation》

《pyramid Attention Network for Semantic Segmentation》

统计学习与视觉计算组

44+阅读 · 2018年8月30日

【论文推荐】最新七篇图像分割相关论文—域适应深度表示学习、循环残差卷积、二值分割、图像合成、无监督跨模态

【论文推荐】最新七篇图像分割相关论文—域适应深度表示学习、循环残差卷积、二值分割、图像合成、无监督跨模态

专知

19+阅读 · 2018年6月1日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】全卷积语义分割综述

【推荐】全卷积语义分割综述

机器学习研究会

19+阅读 · 2017年8月31日

相关论文

Real-Time Segmentation Networks should be Latency Aware

Arxiv

0+阅读 · 2022年4月20日

CSRNet: Cascaded Selective Resolution Network for Real-time Semantic Segmentation

Arxiv

0+阅读 · 2022年4月19日

Fast and Memory-Efficient Network Towards Efficient Image Super-Resolution

Fast and Memory-Efficient Network Towards Efficient Image Super-Resolution

Arxiv

0+阅读 · 2022年4月18日

BA-Net: Bridge Attention for Deep Convolutional Neural Networks

Arxiv

0+阅读 · 2022年4月18日

BDG-Net: Boundary Distribution Guided Network for Accurate Polyp Segmentation

Arxiv

0+阅读 · 2022年4月17日

Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation

Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation

Arxiv

0+阅读 · 2022年4月15日

Decoupling Zero-Shot Semantic Segmentation

Arxiv

0+阅读 · 2022年4月15日

K-Net: Towards Unified Image Segmentation

Arxiv

12+阅读 · 2021年11月1日

Deep Representation Learning for Domain Adaptation of Semantic Image Segmentation

Arxiv

10+阅读 · 2018年5月10日

An application of cascaded 3D fully convolutional networks for medical image segmentation

Arxiv

10+阅读 · 2018年3月20日

相关基金

多特征融合与集成学习的城市高分辨率遥感影像变化检测

国家自然科学基金

4+阅读 · 2014年12月31日

基于IFC的建筑信息模型(BIM)语义检索技术研究

国家自然科学基金

1+阅读 · 2014年12月31日

目标跟踪中的时空上下文建模方法研究

国家自然科学基金

2+阅读 · 2013年12月31日

基于因子图的无线传感器网络分布式推理方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向图像分割的自适应脉冲耦合神经网络理论及应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向MapReduce的网络存储系统优化技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于水下无线传感网络的目标跟踪研究

国家自然科学基金

2+阅读 · 2012年12月31日

基于可移动摄像头的协同式安全监控和目标跟踪

国家自然科学基金

0+阅读 · 2012年12月31日

心脏植入电子装置早期感染的诊断研究

国家自然科学基金

0+阅读 · 2011年12月31日

InSAR支持下基于支持向量机的地震滑坡空间预测研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员