Semi-supervised learning acts as an effective way to leverage massive unlabeled data. In this paper, we propose a novel training strategy, termed as Semi-supervised Contrastive Learning (SsCL), which combines the well-known contrastive loss in self-supervised learning with the cross entropy loss in semi-supervised learning, and jointly optimizes the two objectives in an end-to-end way. The highlight is that different from self-training based semi-supervised learning that conducts prediction and retraining over the same model weights, SsCL interchanges the predictions over the unlabeled data between the two branches, and thus formulates a co-calibration procedure, which we find is beneficial for better prediction and avoid being trapped in local minimum. Towards this goal, the contrastive loss branch models pairwise similarities among samples, using the nearest neighborhood generated from the cross entropy branch, and in turn calibrates the prediction distribution of the cross entropy branch with the contrastive similarity. We show that SsCL produces more discriminative representation and is beneficial to few shot learning. Notably, on ImageNet with ResNet50 as the backbone, SsCL achieves 60.2% and 72.1% top-1 accuracy with 1% and 10% labeled samples, respectively, which significantly outperforms the baseline, and is better than previous semi-supervised and self-supervised methods.
翻译:半监督的学习是利用大量未贴标签的数据的有效手段。 在本文中,我们提出一个新的培训战略,称为半监督的对抗性学习(SSCL),将自监督学习中众所周知的对比性损失与半监督的学习中交叉酶性损失结合起来,共同以端到端的方式优化这两个目标。重点在于不同于基于自我培训的半监督学习,这种学习对相同的模型重量进行预测和再培训,SSCL对两个分支之间未标签的数据交换预测,从而制定一个共同校正程序,我们发现这有利于更好的预测,避免被困在当地最低程度。为了实现这一目标,对比性损失分支模型将样品之间的相似性对齐,使用从交叉酶分支产生的近邻,反过来将交叉催化分支的预测分布与对比性相似性校准。我们显示,SSCLL的预测性表现更加具有歧视性,并且有利于少数的对未标定数据进行校准,我们发现它有利于更好的预测,并避免被困在本地最低程度。