Noisy labels are inevitable in large real-world datasets. In this work, we explore an area understudied by previous works -- how the network's architecture impacts its robustness to noisy labels. We provide a formal framework connecting the robustness of a network to the alignments between its architecture and target/noise functions. Our framework measures a network's robustness via the predictive power in its representations -- the test performance of a linear model trained on the learned representations using a small set of clean labels. We hypothesize that a network is more robust to noisy labels if its architecture is more aligned with the target function than the noise. To support our hypothesis, we provide both theoretical and empirical evidence across various neural network architectures and different domains. We also find that when the network is well-aligned with the target function, its predictive power in representations could improve upon state-of-the-art (SOTA) noisy-label-training methods in terms of test accuracy and even outperform sophisticated methods that use clean labels.
翻译:在大型真实世界数据集中,噪音标签是不可避免的。在这项工作中,我们探索了一个未受先前工作研究的领域 -- -- 网络的架构如何影响其稳健性对吵闹标签的影响。我们提供了一个正式的框架,将网络的稳健性与其结构与目标/噪音功能之间的对齐联系起来。我们的框架测量了网络的稳健性,其表现方式是预测力 -- -- 使用少量清洁标签进行学习演示的线性模型的测试性能。我们假设,如果网络的架构比噪音更符合目标功能,则网络对吵闹标签更强大。为了支持我们的假设,我们提供了各种神经网络架构和不同领域的理论和经验证据。我们还发现,当网络与目标功能完全吻合时,其动态中的预测力可以在测试准确性甚至使用清洁标签的超常规方法方面改进最先进的标准(SOTA)噪声标签培训方法。