标题：基于软最近邻框架的连续半监督学习摘要：尽管取得了显著进展，但现有的状态下的连续学习方法的性能仍然取决于完全标记的数据。在本文中，我们解决了这个挑战，提出了一种用于连续半监督学习的方法——一种不是所有数据样本都标记的情况。这种情况下的一个主要问题是模型会遗忘未标记数据的表示，并且会过度拟合标记的样本。我们利用最近邻分类器的能力来非线性地分割特征空间，并且通过它的非参数性质来灵活地建模潜在的数据分布。这使得模型能够学习当前任务的强表示，并从以前的任务中提炼相关信息。我们进行了彻底的实验评估，并展示了我们的方法通过大幅度的提高性能超过了所有现有方法，这为连续半监督学习范式设定了坚实的技术最前沿。例如，在CIFAR-100上，我们即使使用至少30倍少的监督（0.8%与25%的注释），也能超过其他几个方法。最后，我们的方法在低分辨率和高分辨率图像上都能很好地工作，并且可以无缝地扩展到更复杂的数据集，如ImageNet-100。代码可在https://github.com/kangzhiq/NNCSL上公开获取。 (A soft nearest-neighbor framework for continual semi-supervised learning)

翻译：标题：基于软最近邻框架的连续半监督学习摘要：尽管取得了显著进展，但现有的状态下的连续学习方法的性能仍然取决于完全标记的数据。在本文中，我们解决了这个挑战，提出了一种用于连续半监督学习的方法——一种不是所有数据样本都标记的情况。这种情况下的一个主要问题是模型会遗忘未标记数据的表示，并且会过度拟合标记的样本。我们利用最近邻分类器的能力来非线性地分割特征空间，并且通过它的非参数性质来灵活地建模潜在的数据分布。这使得模型能够学习当前任务的强表示，并从以前的任务中提炼相关信息。我们进行了彻底的实验评估，并展示了我们的方法通过大幅度的提高性能超过了所有现有方法，这为连续半监督学习范式设定了坚实的技术最前沿。例如，在CIFAR-100上，我们即使使用至少30倍少的监督（0.8%与25%的注释），也能超过其他几个方法。最后，我们的方法在低分辨率和高分辨率图像上都能很好地工作，并且可以无缝地扩展到更复杂的数据集，如ImageNet-100。代码可在https://github.com/kangzhiq/NNCSL上公开获取。

Zhiqi Kang,Enrico Fini,Moin Nabi,Elisa Ricci,Karteek Alahari

from arxiv, 13 pages

Despite significant advances, the performance of state-of-the-art continual learning approaches hinges on the unrealistic scenario of fully labeled data. In this paper, we tackle this challenge and propose an approach for continual semi-supervised learning--a setting where not all the data samples are labeled. A primary issue in this scenario is the model forgetting representations of unlabeled data and overfitting the labeled samples. We leverage the power of nearest-neighbor classifiers to nonlinearly partition the feature space and flexibly model the underlying data distribution thanks to its non-parametric nature. This enables the model to learn a strong representation for the current task, and distill relevant information from previous tasks. We perform a thorough experimental evaluation and show that our method outperforms all the existing approaches by large margins, setting a solid state of the art on the continual semi-supervised learning paradigm. For example, on CIFAR-100 we surpass several others even when using at least 30 times less supervision (0.8% vs. 25% of annotations). Finally, our method works well on both low and high resolution images and scales seamlessly to more complex datasets such as ImageNet-100. The code is publicly available on https://github.com/kangzhiq/NNCSL

翻译：