Clustering-based methods, which alternate between the generation of pseudo labels and the optimization of the feature extraction network, play a dominant role in both unsupervised learning (USL) and unsupervised domain adaptive (UDA) person re-identification (Re-ID). To alleviate the adverse effect of noisy pseudo labels, the existing methods either abandon unreliable labels or refine the pseudo labels via mutual learning or label propagation. However, a great many erroneous labels are still accumulated because these methods mostly adopt traditional unsupervised clustering algorithms which rely on certain assumptions on data distribution and fail to capture the distribution of complex real-world data. In this paper, we propose the plug-and-play graph-based pseudo label correction network (GLC) to refine the pseudo labels in the manner of supervised clustering. GLC is trained to perceive the varying data distribution at each epoch of the self-training with the supervision of initial pseudo labels generated by any clustering method. It can learn to rectify the initial noisy labels by means of the relationship constraints between samples on the k Nearest Neighbor (kNN) graph and early-stop training strategy. Specifically, GLC learns to aggregate node features from neighbors and predict whether the nodes should be linked on the graph. Besides, GLC is optimized with 'early stop' before the noisy labels are severely memorized to prevent overfitting to noisy pseudo labels. Consequently, GLC improves the quality of pseudo labels though the supervision signals contain some noise, leading to better Re-ID performance. Extensive experiments in USL and UDA person Re-ID on Market-1501 and MSMT17 show that our method is widely compatible with various clustering-based methods and promotes the state-of-the-art performance consistently.
翻译:在生成假标签和优化地物提取网络之间互换的基于集群的方法在生成伪标签和优化地物提取网络之间,在未经监督的学习(USL)和未经监督的域图化(UDA)个人重新识别(Re-ID)中发挥着主导作用。为减轻杂音伪标签的不利影响,现有方法要么放弃不可靠的标签或通过相互学习或标签传播来改进伪标签。然而,大量错误标签仍在积累,因为这些方法大多采用依赖某些数据分配假设和无法获取复杂真实世界数据的分布的传统、不受监督的分组算法。在本文件中,我们建议基于插座和剧本图的伪造标签修正(UDA)网络(GLC)来改进假标签的负面效果。GLC在自我培训过程中,通过监督任何组合方法生成的初始假标签,发现不同的数据分布不尽相同。通过在接近Neigh Nerighbbbbor(KNNNE)的标本和早期测试策略下,通过GLC(G-LC)来持续地、从G-D(Oral-LD)和Oral-M(G-M(O)的模拟)中,在G-LD(G-LD)中学习更精确地和最精确地和最精确地)的模拟的模拟的模拟的模拟的模拟的模拟的模拟的模拟的运行,应该从G-LC-LC-LC-LC-LC-LD)和最精确的模拟战略。