空间噪音课程学习培训前 (Contrastive Object-level Pre-training with Spatial Noise Curriculum Learning)

The goal of contrastive learning based pre-training is to leverage large quantities of unlabeled data to produce a model that can be readily adapted downstream. Current approaches revolve around solving an image discrimination task: given an anchor image, an augmented counterpart of that image, and some other images, the model must produce representations such that the distance between the anchor and its counterpart is small, and the distances between the anchor and the other images are large. There are two significant problems with this approach: (i) by contrasting representations at the image-level, it is hard to generate detailed object-sensitive features that are beneficial to downstream object-level tasks such as instance segmentation; (ii) the augmentation strategy of producing an augmented counterpart is fixed, making learning less effective at the later stages of pre-training. In this work, we introduce Curricular Contrastive Object-level Pre-training (CCOP) to tackle these problems: (i) we use selective search to find rough object regions and use them to build an inter-image object-level contrastive loss and an intra-image object-level discrimination loss into our pre-training objective; (ii) we present a curriculum learning mechanism that adaptively augments the generated regions, which allows the model to consistently acquire a useful learning signal, even in the later stages of pre-training. Our experiments show that our approach improves on the MoCo v2 baseline by a large margin on multiple object-level tasks when pre-training on multi-object scene image datasets. Code is available at https://github.com/ChenhongyiYang/CCOP.

翻译：对比式学习前培训的目的是利用大量未贴标签的数据来制作一个能够随时适应下游的模型。当前的方法围绕解决图像歧视任务:如果有一个锚图像,该图像的放大对应方,以及其他一些图像,模型必须产生表征,使锚与其对应方之间的距离小,锚与其他图像之间的距离大。这种方法有两个重大问题:(一)通过图像层面的对比演示,很难产生有利于下游目标层面任务的详细的对目标敏感特性,例如实例分割;(二)制作一个强化对应方的增强战略是固定的,使得在培训前的后期阶段学习效果较差。在这项工作中,我们引入了与目标水平相抗的立体距离,而锁定目标与其他图像之间的距离也很大。这种方法有两个重大问题:(一) 我们使用选择性搜索来寻找粗糙的物体区域,并利用它们将目标层面的对比性损失和内部目标层面的歧视损失构建到我们的培训前目标层面;(二) 我们展示一个强化对应方的增强战略对应方的增强战略的强化战略,在培训前的后期学习阶段里,我们不断学习一个大层次。