Convolutional Neural Networks (CNNs) have demonstrated superiority in learning patterns, but are sensitive to label noises and may overfit noisy labels during training. The early stopping strategy averts updating CNNs during the early training phase and is widely employed in the presence of noisy labels. Motivated by biological findings that the amplitude spectrum (AS) and phase spectrum (PS) in the frequency domain play different roles in the animal's vision system, we observe that PS, which captures more semantic information, can increase the robustness of DNNs to label noise, more so than AS can. We thus propose early stops at different times for AS and PS by disentangling the features of some layer(s) into AS and PS using Discrete Fourier Transform (DFT) during training. Our proposed Phase-AmplituDe DisentangLed Early Stopping (PADDLES) method is shown to be effective on both synthetic and real-world label-noise datasets. PADDLES outperforms other early stopping methods and obtains state-of-the-art performance.
翻译:进化神经网络(CNNs)在学习模式方面表现出优势,但对标签噪音敏感,在培训期间可能过度贴上噪音标签。早期停止战略避免了在早期培训阶段更新CNN, 并在有噪音标签的情况下被广泛使用。由于生物发现,频率域中的振幅频谱和相频频谱在动物的视觉系统中起着不同的作用,我们观察到,捕捉更多语义信息的PS能够提高DNS对标签噪音的坚固性,比AS更强。因此,我们建议在不同的时间尽早停止对AS和PS的干扰,方法是在培训期间使用分辨的四面形变形(DFT)将某些层(s)的特征切换到AS和PS。我们提议的PADDLES(PADDLES)方法对合成和真实世界标签-噪音数据集都有效。PDDLES超越了其他早期停止方法,并获得了状态性能。