Continual learning in real-world scenarios is a major challenge. A general continual learning model should have a constant memory size and no predefined task boundaries, as is the case in semi-supervised Video Object Segmentation (VOS), where continual learning challenges particularly present themselves in working on long video sequences. In this article, we first formulate the problem of semi-supervised VOS, specifically online VOS, as a continual learning problem, and then secondly provide a public VOS dataset, CLVOS23, focusing on continual learning. Finally, we propose and implement a regularization-based continual learning approach on LWL, an existing online VOS baseline, to demonstrate the efficacy of continual learning when applied to online VOS and to establish a CLVOS23 baseline. We apply the proposed baseline to the Long Videos dataset as well as to two short video VOS datasets, DAVIS16 and DAVIS17. To the best of our knowledge, this is the first time that VOS has been defined and addressed as a continual learning problem.
翻译:连续学习在现实场景中是一个重大的挑战。一个通用的连续学习模型应该具有恒定的内存大小和无预定义的任务边界,就像半监督视频物体分割(VOS)中的情况一样,其中在处理长时间视频序列时,连续学习的挑战特别明显。在本文中,我们首先将半监督VOS问题,特别是在线VOS,形式化为持续学习问题,然后提供一个公开的VOS数据集,CLVOS23,重点关注连续学习。最后,我们提出并实现了一种基于正则化的连续学习方法,将其应用于LWL,一个现有的在线VOS基线上,以证明连续学习在应用于在线VOS时的有效性,并建立了CLVOS23的基线。我们将所提出的基线应用于长视频数据集以及两个短视频VOS数据集DAVIS16和DAVIS17。据我们所知,这是第一次将VOS定义和解决为连续学习问题。