实时视频截断功能的扭曲软件网络预视和功能再使用 (Distortion-Aware Network Pruning and Feature Reuse for Real-time Video Segmentation)

Real-time video segmentation is a crucial task for many real-world applications such as autonomous driving and robot control. Since state-of-the-art semantic segmentation models are often too heavy for real-time applications despite their impressive performance, researchers have proposed lightweight architectures with speed-accuracy trade-offs, achieving real-time speed at the expense of reduced accuracy. In this paper, we propose a novel framework to speed up any architecture with skip-connections for real-time vision tasks by exploiting the temporal locality in videos. Specifically, at the arrival of each frame, we transform the features from the previous frame to reuse them at specific spatial bins. We then perform partial computation of the backbone network on the regions of the current frame that captures temporal differences between the current and previous frame. This is done by dynamically dropping out residual blocks using a gating mechanism which decides which blocks to drop based on inter-frame distortion. We validate our Spatial-Temporal Mask Generator (STMG) on video semantic segmentation benchmarks with multiple backbone networks, and show that our method largely speeds up inference with minimal loss of accuracy.

翻译：实时视频分割是许多现实世界应用的关键任务,例如自主驾驶和机器人控制。由于最新语义分割模型尽管性能令人印象深刻,但往往对实时应用来说过于沉重,因此研究人员提出了具有速度-准确取舍的轻量结构,以降低准确性为代价实现了实时速度。在本文中,我们提出了一个新框架,通过利用视频中的时间定位点,加速任何具有跳过连接的实时视觉任务结构。具体地说,在每个框架到达时,我们从以前的框架转换特征,再利用它们到特定的空间文件箱。我们随后对当前框架区域的主干网进行部分计算,以捕捉到当前框架和以往框架之间的时间差异。这是通过使用一种定位机制,即决定哪些区块会因跨框架扭曲而下降,从而动态地将残余块扔出去。我们用多个主干网的视频语义分割基准验证了我们的空间-时态面具生成器(STMG),并表明我们的方法在很大程度上加快了推断速度,从而降低了准确性。

相关内容

Networking

关注 22

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【CVPR 2022】基于windows的图像压缩注意，The Devil Is in the Details: Window-based Attention for Image Compression

专知会员服务

8+阅读 · 2022年3月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日