In this paper, we address the problem of training deep neural networks in the presence of severe label noise. Our proposed training algorithm ScanMix, combines semantic clustering with semi-supervised learning (SSL) to improve the feature representations and enable an accurate identification of noisy samples, even in severe label noise scenarios. To be specific, ScanMix is designed based on the expectation maximisation (EM) framework, where the E-step estimates the value of a latent variable to cluster the training images based on their appearance representations and classification results, and the M-step optimises the SSL classification and learns effective feature representations via semantic clustering. In our evaluations, we show state-of-the-art results on standard benchmarks for symmetric, asymmetric and semantic label noise on CIFAR-10 and CIFAR-100, as well as large scale real label noise on WebVision. Most notably, for the benchmarks contaminated with large noise rates (80% and above), our results are up to 27% better than the related work. The code is available at https://github.com/ragavsachdeva/ScanMix.
翻译:在本文中,我们讨论了在有严重标签噪音的情况下培训深神经网络的问题。我们建议的培训算法ScanMix,将语义组合与半监督学习相结合,以改进特征表现,并能够准确识别噪音样品,即使在严格的标签噪音情况下也是如此。具体地说,ScanMix的设计基于预期最大化框架,E-步骤估计了一个潜在变量的价值,以基于其外观表现和分类结果对培训图像进行分组,M-步骤优化SSL分类,并通过语义组合学习有效特征表现。我们的评估显示,在CIRA-10和CIFAR-100上,以及在网络Vision上大规模真实标签噪音的标准基准标准标准标准标准标准标准标准标准上,最显著的是,对于受到大噪音率污染的基准(80%以上),我们的结果比相关工作高出27%。代码见https://github.comragavachdeva/ScanMix。