As the size of the dataset used in deep learning tasks increases, the noisy label problem, which is a task of making deep learning robust to the incorrectly labeled data, has become an important task. In this paper, we propose a method of learning noisy label data using the label noise selection with test-time augmentation (TTA) cross-entropy and classifier learning with the NoiseMix method. In the label noise selection, we propose TTA cross-entropy by measuring the cross-entropy to predict the test-time augmented training data. In the classifier learning, we propose the NoiseMix method based on MixUp and BalancedMix methods by mixing the samples from the noisy and the clean label data. In experiments on the ISIC-18 public skin lesion diagnosis dataset, the proposed TTA cross-entropy outperformed the conventional cross-entropy and the TTA uncertainty in detecting label noise data in the label noise selection process. Moreover, the proposed NoiseMix not only outperformed the state-of-the-art methods in the classification performance but also showed the most robustness to the label noise in the classifier learning.
翻译:随着深层学习任务中所用数据集的大小的增大,作为使深层学习对错误标签数据进行强力强化的任务,响亮标签问题已成为一项重要任务。在本文件中,我们建议采用一种方法,利用测试-时间增强(TTA)跨渗透性激素和分类学方法来学习标签的噪声选择;在标签噪声选择中,我们建议通过测量跨性激素来预测测试-时间增强的培训数据,从而进行跨性激素。在分类学习中,我们建议采用基于混合和平衡混合方法,将杂音和清洁标签数据的样本混合起来。在ISIC-18公众皮肤损伤诊断数据集的实验中,拟议的TTTA跨性激素超越了传统的跨性激素和TTA在探测标签噪音数据时的不确定性。此外,拟议的NoiseMix不仅超越了分类性能中的状态-艺术方法,而且还展示了在分类学习的噪声中最牢固的标签。