Semi-supervised learning (SSL) has been widely explored in recent years, and it is an effective way of leveraging unlabeled data to reduce the reliance on labeled data. In this work, we adjust neural processes (NPs) to the semi-supervised image classification task, resulting in a new method named NP-Match. NP-Match is suited to this task for two reasons. Firstly, NP-Match implicitly compares data points when making predictions, and as a result, the prediction of each unlabeled data point is affected by the labeled data points that are similar to it, which improves the quality of pseudo-labels. Secondly, NP-Match is able to estimate uncertainty that can be used as a tool for selecting unlabeled samples with reliable pseudo-labels. Compared with uncertainty-based SSL methods implemented with Monte-Carlo (MC) dropout, NP-Match estimates uncertainty with much less computational overhead, which can save time at both the training and the testing phases. We conducted extensive experiments on five public datasets under three semi-supervised image classification settings, namely, the standard semi-supervised image classification, the imbalanced semi-supervised image classification, and the multi-label semi-supervised image classification, and NP-Match outperforms state-of-the-art (SOTA) approaches or achieves competitive results on them, which shows the effectiveness of NP-Match and its potential for SSL. The codes are at https://github.com/Jianf-Wang/NP-Match
翻译:近些年来,人们广泛探索了半监督学习(SSL),这是利用未贴标签的数据来减少对标签数据的依赖的有效方法。 在这项工作中,我们调整神经过程(NP)以适应半监督图像分类任务,从而产生名为NP-Match的新方法。 NP-Match与这项任务相适应,原因有二。 首先,NP-Match在作出预测时暗含地比较数据点,结果,每个未贴标签的数据点的预测受到类似标签的数据点的影响,这提高了假标签的质量。 其次,NP-Match能够估计不确定性,而这种不确定性可以作为一种工具,用可靠的假标签来选择未贴标签的样本。 比较与Monte-Carlo(MC)的辍学一起实施的基于不确定性的 SSLSL方法, NP-Match估计不确定性,而计算间接费用则要少得多,这可以在培训和测试阶段节省时间。 我们在三种半监督的半监督的图像分类、半监督的S-NP-M图像分类中,在半监督的图像分类中对5个公共数据集进行了广泛的实验。