The recently proposed FixMatch achieved state-of-the-art results on most semi-supervised learning (SSL) benchmarks. However, like other modern SSL algorithms, FixMatch uses a pre-defined constant threshold for all classes to select unlabeled data that contribute to the training, thus failing to consider different learning status and learning difficulties of different classes. To address this issue, we propose Curriculum Pseudo Labeling (CPL), a curriculum learning approach to leverage unlabeled data according to the model's learning status. The core of CPL is to flexibly adjust thresholds for different classes at each time step to let pass informative unlabeled data and their pseudo labels. CPL does not introduce additional parameters or computations (forward or backward propagation). We apply CPL to FixMatch and call our improved algorithm FlexMatch. FlexMatch achieves state-of-the-art performance on a variety of SSL benchmarks, with especially strong performances when the labeled data are extremely limited or when the task is challenging. For example, FlexMatch outperforms FixMatch by 14.32% and 24.55% on CIFAR-100 and STL-10 datasets respectively, when there are only 4 labels per class. CPL also significantly boosts the convergence speed, e.g., FlexMatch can use only 1/5 training time of FixMatch to achieve even better performance. Furthermore, we show that CPL can be easily adapted to other SSL algorithms and remarkably improve their performances. We open source our code at https://github.com/TorchSSL/TorchSSL.
翻译:最近提议的FixMatch在大多数半监督学习基准(SSL)上取得了最新的最新成果。 但是,与其他现代 SSL 算法一样,FixMatch对所有班级选择有助于培训的非标签数据使用预定义的常数阈值,从而无法考虑不同学习状况和不同班级的学习困难。为了解决这个问题,我们建议课程PsedoLabeling(CPL),这是根据模型的学习状况来利用无标签数据的一种课程学习方法。CPL的核心是灵活调整不同班级的门槛,让信息化的非标签数据及其假标签通过。 CPL不引入额外的参数或计算( 前向或后向传播 ) 。 我们将CPL 应用到 FlexMatch 中, 我们改进的算法在各种 SSL基准上达到最先进的表现, 当标签数据只有开放性能或有挑战性能。 例如, FlexMLSL 超过eferformass 的 eflexMatchs requist efrichmass by 14.32% and CIFL55-10L silvildalationalations.