Labels are costly and sometimes unreliable. Noisy label learning, semi-supervised learning, and contrastive learning are three different strategies for designing learning processes requiring less annotation cost. Semi-supervised learning and contrastive learning have been recently demonstrated to improve learning strategies that address datasets with noisy labels. Still, the inner connections between these fields as well as the potential to combine their strengths together have only started to emerge. In this paper, we explore further ways and advantages to fuse them. Specifically, we propose CSSL, a unified Contrastive Semi-Supervised Learning algorithm, and CoDiM (Contrastive DivideMix), a novel algorithm for learning with noisy labels. CSSL leverages the power of classical semi-supervised learning and contrastive learning technologies and is further adapted to CoDiM, which learns robustly from multiple types and levels of label noise. We show that CoDiM brings consistent improvements and achieves state-of-the-art results on multiple benchmarks.
翻译:标签是昂贵的,有时是不可靠的。噪音标签学习、半监督的学习和对比学习是设计学习过程的三种不同的战略,需要较少的注解费用。半监督的学习和对比学习最近被证明是用来改进学习战略,用吵闹的标签处理数据集的半监督的学习和对比学习。然而,这些领域之间的内在联系以及将它们的优势结合在一起的潜力只是刚刚开始出现。在本文中,我们探索了进一步的方法和优势来融合它们。具体地说,我们提出了CSSL,一个统一的对抗半监督学习算法和CoDiM(CoDIM),一个用吵闹的标签进行学习的新型算法。CSSL利用传统的半监督学习和对比学习技术的力量,并进一步适应CDIM,后者从多种类型和等级的标签噪音中有力地学习。我们显示,CDIM带来一致的改进,并在多个基准上取得最新的结果。