Modern video object segmentation (VOS) algorithms have achieved remarkably high performance in a sequential processing order, while most of currently prevailing pipelines still show some obvious inadequacy like accumulative error, unknown robustness or lack of proper interpretation tools. In this paper, we place the semi-supervised video object segmentation problem into a cyclic workflow and find the defects above can be collectively addressed via the inherent cyclic property of semi-supervised VOS systems. Firstly, a cyclic mechanism incorporated to the standard sequential flow can produce more consistent representations for pixel-wise correspondance. Relying on the accurate reference mask in the starting frame, we show that the error propagation problem can be mitigated. Next, a simple gradient correction module, which naturally extends the offline cyclic pipeline to an online manner, can highlight the high-frequent and detailed part of results to further improve the segmentation quality while keeping feasible computation cost. Meanwhile such correction can protect the network from severe performance degration resulted from interference signals. Finally we develop cycle effective receptive field (cycle-ERF) based on gradient correction process to provide a new perspective into analyzing object-specific regions of interests. We conduct comprehensive comparison and detailed analysis on challenging benchmarks of DAVIS16, DAVIS17 and Youtube-VOS, demonstrating that the cyclic mechanism is helpful to enhance segmentation quality, improve the robustness of VOS systems, and further provide qualitative comparison and interpretation on how different VOS algorithms work. The code of this project can be found at https://github.com/lyxok1/STM-Training
翻译:现代视频目标分割法(VOS)在连续处理顺序中取得了非常高的性能,而目前大多数管道仍显示某些明显不足,如累积错误、未知强度或缺乏适当的解释工具。在本文件中,我们将半监督视频目标分割问题置于循环工作流程中,发现上述缺陷可以通过半监督VOS系统固有的循环特性集体解决。首先,纳入标准连续流的循环机制可以产生更一致的比素顺差表示。在起始框架的准确参考掩码上,我们表明错误传播问题可以减轻。接下来,一个简单的梯度修正模块,将离线视频目标分割问题自然扩展到在线方式,可以突出高反复和详细的结果部分,以进一步提高半监督VOS系统固有的分解特性,同时保持可行的计算成本。同时,这种修正可以保护网络不受标准相干干涉信号造成的严重性分解影响。最后,我们开发了基于梯度校校校校正的循环有效接收场(循环-ERF),我们展示了错误传播问题传播问题减少。一个简单的梯度校正校正校正校准模块,将自动将离子分析系统的质量分级分析目标系统。