翻译标题：合规分类中的良性过拟合：基于更大模型的可证明反标签噪声翻译摘要：对良性过拟合的研究为超参数深度学习模型的成功提供了洞察力。在这项工作中，我们研究了过拟合是否在现实世界的分类任务中真正是良性的。我们从观察到一个ResNet模型在Cifar10上良性过拟合，但在ImageNet上不良性过拟合开始。为了了解为什么良性过拟合在ImageNet实验中失败，我们在比较轻微的超参数设置下理论分析良性过拟合，其中参数数量不显著大于数据点数量。在这种轻微的超参数设置下，我们的分析确定了一个相变：与之前的重度超参数化设置不同，良性过拟合现在可能在存在标签噪声的情况下失败。我们的分析解释了我们的实验观察结果，并得到通过一组ResNets的对照实验的验证。我们的工作强调了理解欠拟合机制的隐含偏差作为未来方向的重要性。 (Benign Overfitting in Classification: Provably Counter Label Noise with Larger Models)

翻译：翻译标题：合规分类中的良性过拟合：基于更大模型的可证明反标签噪声翻译摘要：对良性过拟合的研究为超参数深度学习模型的成功提供了洞察力。在这项工作中，我们研究了过拟合是否在现实世界的分类任务中真正是良性的。我们从观察到一个ResNet模型在Cifar10上良性过拟合，但在ImageNet上不良性过拟合开始。为了了解为什么良性过拟合在ImageNet实验中失败，我们在比较轻微的超参数设置下理论分析良性过拟合，其中参数数量不显著大于数据点数量。在这种轻微的超参数设置下，我们的分析确定了一个相变：与之前的重度超参数化设置不同，良性过拟合现在可能在存在标签噪声的情况下失败。我们的分析解释了我们的实验观察结果，并得到通过一组ResNets的对照实验的验证。我们的工作强调了理解欠拟合机制的隐含偏差作为未来方向的重要性。

Kaiyue Wen,Jiaye Teng,Jingzhao Zhang

from arxiv, Published as a conference paper at ICLR 2023

Studies on benign overfitting provide insights for the success of overparameterized deep learning models. In this work, we examine whether overfitting is truly benign in real-world classification tasks. We start with the observation that a ResNet model overfits benignly on Cifar10 but not benignly on ImageNet. To understand why benign overfitting fails in the ImageNet experiment, we theoretically analyze benign overfitting under a more restrictive setup where the number of parameters is not significantly larger than the number of data points. Under this mild overparameterization setup, our analysis identifies a phase change: unlike in the previous heavy overparameterization settings, benign overfitting can now fail in the presence of label noise. Our analysis explains our empirical observations, and is validated by a set of control experiments with ResNets. Our work highlights the importance of understanding implicit bias in underfitting regimes as a future direction.

翻译：