PIDNet:一种受PID控制器启发的实时语义分割网络 (PIDNet: A Real-time Semantic Segmentation Network Inspired by PID Controllers) - 专知论文

会员服务 ·

0

PID · 上下文 · PID控制 · 超调 · 控制器 ·

2023 年 4 月 7 日

PIDNet: A Real-time Semantic Segmentation Network Inspired by PID Controllers

翻译：PIDNet:一种受PID控制器启发的实时语义分割网络

Jiacong Xu,Zixiang Xiong,Shankar P. Bhattacharyya

from arxiv, 11 pages, 9 figures; This paper will be published by CVPR2023 soon, please refer to the official version then

Two-branch network architecture has shown its efficiency and effectiveness in real-time semantic segmentation tasks. However, direct fusion of high-resolution details and low-frequency context has the drawback of detailed features being easily overwhelmed by surrounding contextual information. This overshoot phenomenon limits the improvement of the segmentation accuracy of existing two-branch models. In this paper, we make a connection between Convolutional Neural Networks (CNN) and Proportional-Integral-Derivative (PID) controllers and reveal that a two-branch network is equivalent to a Proportional-Integral (PI) controller, which inherently suffers from similar overshoot issues. To alleviate this problem, we propose a novel three-branch network architecture: PIDNet, which contains three branches to parse detailed, context and boundary information, respectively, and employs boundary attention to guide the fusion of detailed and context branches. Our family of PIDNets achieve the best trade-off between inference speed and accuracy and their accuracy surpasses all the existing models with similar inference speed on the Cityscapes and CamVid datasets. Specifically, PIDNet-S achieves 78.6% mIOU with inference speed of 93.2 FPS on Cityscapes and 80.1% mIOU with speed of 153.7 FPS on CamVid.

翻译：两分支网络结构已经在实时语义分割任务中显示出其效率和有效性。然而，高分辨率细节和低频上下文直接融合的缺点是详情特征很容易被周围的上下文信息淹没。这种超调现象限制了现有的两分支模型分割精度的提高。在本文中，我们建立了卷积神经网络（CNN）和比例-积分-微分（PID）控制器之间的联系，并揭示了两分支网络等效于比例-积分（PI）控制器，其本质上具有类似的超调问题。为了减轻这个问题，我们提出了一种新颖的三分支网络结构：PIDNet，它包含三个分支，分别解析详细信息、上下文信息和边界信息，并采用边界关注指导详细分支和上下文分支的融合。我们的PIDNet系列在推理速度和准确性之间取得了最佳折中点，并且它们的准确性超过了所有类似推理速度的现有模型，在Cityscapes和CamVid数据集上均如此。具体而言，PIDNet-S在Cityscapes上的推理速度为93.2 FPS，mIOU为78.6％；在CamVid上的推理速度为153.7 FPS，mIOU为80.1％。

0

相关内容

PID

【TNNLS2022】SGCPNet: 面向实时语义分割的空间细节引导上下文传播网络

【TNNLS2022】SGCPNet: 面向实时语义分割的空间细节引导上下文传播网络

专知会员服务

24+阅读 · 2022年4月8日

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

专知会员服务

18+阅读 · 2022年3月19日

【CVPR 2022】MixFormer：跨窗口与维度的特征融合，MixFormer: Mixing Features across Windows and Dimensions

【CVPR 2022】MixFormer：跨窗口与维度的特征融合，MixFormer: Mixing Features across Windows and Dimensions

专知会员服务

15+阅读 · 2022年3月19日

[ICCV 2021] 从二到一：一种带有视觉语言建模网络的新场景文本识别器

专知会员服务

17+阅读 · 2021年10月17日

20篇「ICCV2021 Oral」最新论文抢先看！看当下计算机视觉在研究什么？

20篇「ICCV2021 Oral」最新论文抢先看！看当下计算机视觉在研究什么？

专知会员服务

62+阅读 · 2021年7月30日

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

专知会员服务

44+阅读 · 2020年3月26日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

专知会员服务

13+阅读 · 2019年11月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

一文带你读懂 DeconvNet 上采样层（语义分割）

一文带你读懂 DeconvNet 上采样层（语义分割）

AI研习社

26+阅读 · 2019年3月16日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

泡泡机器人SLAM

22+阅读 · 2018年12月4日

《pyramid Attention Network for Semantic Segmentation》

《pyramid Attention Network for Semantic Segmentation》

统计学习与视觉计算组

44+阅读 · 2018年8月30日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】全卷积语义分割综述

【推荐】全卷积语义分割综述

机器学习研究会

19+阅读 · 2017年8月31日

带有噪声扰动的动力系统分支问题研究

国家自然科学基金

0+阅读 · 2015年12月31日

智能电网中ZigBee网络的实时拓扑优化和高效广播传输算法研究

国家自然科学基金

1+阅读 · 2013年12月31日

有理映射的参数空间

国家自然科学基金

0+阅读 · 2013年12月31日

片上网络的高效拥塞感知及协同拥塞控制方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

动态多摄像头环境中拥挤多目标跟踪的联合建模与协同优化

国家自然科学基金

0+阅读 · 2013年12月31日

非线性系统全局输出反馈：镇定、跟踪和应用

国家自然科学基金

0+阅读 · 2012年12月31日

确保网络化多机器人协同跟踪的实时通信条件研究

国家自然科学基金

0+阅读 · 2012年12月31日

缆系式紧耦合多机器人系统协调建模及稳定性分析

国家自然科学基金

0+阅读 · 2012年12月31日

自主机器人基于全景视觉的大范围未知环境归航方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

模块化非线性系统辨识

国家自然科学基金

0+阅读 · 2011年12月31日

GRAtt-VIS: Gated Residual Attention for Auto Rectifying Video Instance Segmentation

Arxiv

0+阅读 · 2023年5月26日

On the Robustness of Segment Anything

Arxiv

0+阅读 · 2023年5月25日

MaxViT-UNet: Multi-Axis Attention for Medical Image Segmentation

Arxiv

0+阅读 · 2023年5月25日

GTNet: Graph Transformer Network for 3D Point Cloud Classification and Semantic Segmentation

Arxiv

0+阅读 · 2023年5月24日

MMNet: Multi-Mask Network for Referring Image Segmentation

Arxiv

0+阅读 · 2023年5月24日

End-to-End Video Instance Segmentation with Transformers

Arxiv

10+阅读 · 2021年3月24日

Image Segmentation Using Deep Learning: A Survey

Arxiv

17+阅读 · 2020年11月15日

Linkage Based Face Clustering via Graph Convolution Network

Arxiv

16+阅读 · 2019年3月27日

nnU-Net: Self-adapting Framework for U-Net-Based Medical Image Segmentation

Arxiv

12+阅读 · 2018年9月27日

End-to-End Multi-Task Learning with Attention

Arxiv

19+阅读 · 2018年3月28日

VIP会员

文章信息

相关主题

相关VIP内容

【TNNLS2022】SGCPNet: 面向实时语义分割的空间细节引导上下文传播网络

【TNNLS2022】SGCPNet: 面向实时语义分割的空间细节引导上下文传播网络

专知会员服务

24+阅读 · 2022年4月8日

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

专知会员服务

18+阅读 · 2022年3月19日

【CVPR 2022】MixFormer：跨窗口与维度的特征融合，MixFormer: Mixing Features across Windows and Dimensions

【CVPR 2022】MixFormer：跨窗口与维度的特征融合，MixFormer: Mixing Features across Windows and Dimensions

专知会员服务

15+阅读 · 2022年3月19日

[ICCV 2021] 从二到一：一种带有视觉语言建模网络的新场景文本识别器

专知会员服务

17+阅读 · 2021年10月17日

20篇「ICCV2021 Oral」最新论文抢先看！看当下计算机视觉在研究什么？

20篇「ICCV2021 Oral」最新论文抢先看！看当下计算机视觉在研究什么？

专知会员服务

62+阅读 · 2021年7月30日

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

专知会员服务

44+阅读 · 2020年3月26日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

专知会员服务

13+阅读 · 2019年11月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《2024年度美国防部作战测试与评估报告》500页

《面相未来作战空中系统中有人-无人编组的AI驱动协作模式选择》含slides

无人机编队飞行：复杂环境中作战的策略、挑战与应用

《探索军事背景下共享大语言模型：AI助手与智能体部署中可扩展性与效率的早期洞察》（含44页slides）

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

一文带你读懂 DeconvNet 上采样层（语义分割）

一文带你读懂 DeconvNet 上采样层（语义分割）

AI研习社

26+阅读 · 2019年3月16日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

泡泡机器人SLAM

22+阅读 · 2018年12月4日

《pyramid Attention Network for Semantic Segmentation》

《pyramid Attention Network for Semantic Segmentation》

统计学习与视觉计算组

44+阅读 · 2018年8月30日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】全卷积语义分割综述

【推荐】全卷积语义分割综述

机器学习研究会

19+阅读 · 2017年8月31日

相关论文

GRAtt-VIS: Gated Residual Attention for Auto Rectifying Video Instance Segmentation

Arxiv

0+阅读 · 2023年5月26日

On the Robustness of Segment Anything

Arxiv

0+阅读 · 2023年5月25日

MaxViT-UNet: Multi-Axis Attention for Medical Image Segmentation

Arxiv

0+阅读 · 2023年5月25日

GTNet: Graph Transformer Network for 3D Point Cloud Classification and Semantic Segmentation

Arxiv

0+阅读 · 2023年5月24日

MMNet: Multi-Mask Network for Referring Image Segmentation

Arxiv

0+阅读 · 2023年5月24日

End-to-End Video Instance Segmentation with Transformers

Arxiv

10+阅读 · 2021年3月24日

Image Segmentation Using Deep Learning: A Survey

Arxiv

17+阅读 · 2020年11月15日

Linkage Based Face Clustering via Graph Convolution Network

Arxiv

16+阅读 · 2019年3月27日

nnU-Net: Self-adapting Framework for U-Net-Based Medical Image Segmentation

Arxiv

12+阅读 · 2018年9月27日

End-to-End Multi-Task Learning with Attention

Arxiv

19+阅读 · 2018年3月28日

相关基金

带有噪声扰动的动力系统分支问题研究

国家自然科学基金

0+阅读 · 2015年12月31日

智能电网中ZigBee网络的实时拓扑优化和高效广播传输算法研究

国家自然科学基金

1+阅读 · 2013年12月31日

有理映射的参数空间

国家自然科学基金

0+阅读 · 2013年12月31日

片上网络的高效拥塞感知及协同拥塞控制方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

动态多摄像头环境中拥挤多目标跟踪的联合建模与协同优化

国家自然科学基金

0+阅读 · 2013年12月31日

非线性系统全局输出反馈：镇定、跟踪和应用

国家自然科学基金

0+阅读 · 2012年12月31日

确保网络化多机器人协同跟踪的实时通信条件研究

国家自然科学基金

0+阅读 · 2012年12月31日

缆系式紧耦合多机器人系统协调建模及稳定性分析

国家自然科学基金

0+阅读 · 2012年12月31日

自主机器人基于全景视觉的大范围未知环境归航方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

模块化非线性系统辨识

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员