Consistency regularization on label predictions becomes a fundamental technique in semi-supervised learning, but it still requires a large number of training iterations for high performance. In this study, we analyze that the consistency regularization restricts the propagation of labeling information due to the exclusion of samples with unconfident pseudo-labels in the model updates. Then, we propose contrastive regularization to improve both efficiency and accuracy of the consistency regularization by well-clustered features of unlabeled data. In specific, after strongly augmented samples are assigned to clusters by their pseudo-labels, our contrastive regularization updates the model so that the features with confident pseudo-labels aggregate the features in the same cluster, while pushing away features in different clusters. As a result, the information of confident pseudo-labels can be effectively propagated into more unlabeled samples during training by the well-clustered features. On benchmarks of semi-supervised learning tasks, our contrastive regularization improves the previous consistency-based methods and achieves state-of-the-art results, especially with fewer training iterations. Our method also shows robust performance on open-set semi-supervised learning where unlabeled data includes out-of-distribution samples.
翻译:标签预测的一致性规范化成为半监督学习中的一项基本技术,但它仍然需要大量的培训迭代才能提高绩效。 在本研究中,我们分析一致性规范化限制标签信息的传播,因为模型更新中排除了不自信的伪标签样本。然后,我们提出对比性规范化,以提高一致性规范化的效率和准确性,通过未贴标签数据的多包特征提高一致性规范化。具体地说,在大量扩大的样本通过假标签分配给集群后,我们的对比性规范化更新了模型,以便具有自信的伪标签的特征汇总同一组群的特征,同时将不同组群的特征推走。结果,在由精集特征培训期间,自信的伪标签信息可以有效地传播到更无标签的样本中。在半监督的学习任务基准方面,我们的对比性规范化改进了先前的基于一致性的方法,并取得了最新的结果,特别是培训的分类。我们的方法还显示,在开放的半监督的样本中,包括学习非标签数据。