One fundamental challenge in building an instance segmentation model for a large number of classes in complex scenes is the lack of training examples, especially for rare objects. In this paper, we explore the possibility to increase the training examples without laborious data collection and annotation. We find that an abundance of instance segments can potentially be obtained freely from object-centric im-ages, according to two insights: (i) an object-centric image usually contains one salient object in a simple background; (ii) objects from the same class often share similar appearances or similar contrasts to the background. Motivated by these insights, we propose a simple and scalable framework FreeSeg for extracting and leveraging these "free" object foreground segments to facilitate model training in long-tailed instance segmentation. Concretely, we employ off-the-shelf object foreground extraction techniques (e.g., image co-segmentation) to generate instance mask candidates, followed by segments refinement and ranking. The resulting high-quality object segments can be used to augment the existing long-tailed dataset, e.g., by copying and pasting the segments onto the original training images. On the LVIS benchmark, we show that FreeSeg yields substantial improvements on top of strong baselines and achieves state-of-the-art accuracy for segmenting rare object categories.
翻译:在复杂的场景中,为大量班级构建一个实例分解模型的基本挑战之一是缺乏培训实例,特别是稀有物品。在本文中,我们探讨是否有可能在不费力收集数据和注释的情况下增加培训实例。我们发现,根据两个洞察力,大量实例区段可能可以自由从以物体为中心的低龄地区获得,具体地说,我们使用现成物体表面提取技术(例如图像共比缩放)生成试样掩体对象,然后是部分精细和排序。由此产生的高质量对象区段可以用来增加现有的长尾数据集,例如,通过复制和粘贴原始图像的原始分层,在原始的S级上实现原始的原始分层的原始分层。