Benign overfitting demonstrates that overparameterized models can perform well on test data while fitting noisy training data. However, it only considers the final min-norm solution in linear regression, which ignores the algorithm information and the corresponding training procedure. In this paper, we generalize the idea of benign overfitting to the whole training trajectory instead of the min-norm solution and derive a time-variant bound based on the trajectory analysis. Starting from the time-variant bound, we further derive a time interval that suffices to guarantee a consistent generalization error for a given feature covariance. Unlike existing approaches, the newly proposed generalization bound is characterized by a time-variant effective dimension of feature covariance. By introducing the time factor, we relax the strict assumption on the feature covariance matrix required in previous benign overfitting under the regimes of overparameterized linear regression with gradient descent. This paper extends the scope of benign overfitting, and experiment results indicate that the proposed bound accords better with empirical evidence.
翻译:过度调整表明,超度模型在安装噪音培训数据的同时,在测试数据上能够很好地发挥作用;然而,它只考虑线性回归的最后中中下方溶液,它忽略了算法信息和相应的培训程序;在本文中,我们概括了将稳妥地过度适应整个培训轨迹而不是中下方溶液的想法,并根据轨迹分析得出一个时间差的界限。从时间差的界限开始,我们进一步得出一个时间间隔,足以保证对特定特征的共变性有一个一致的普遍化错误。与现有方法不同,新提出的一般化约束的特点是特征共变化具有时间差的有效维度。通过引入时间因素,我们放松了先前的稳妥地过度适应参数的特征共变矩阵的严格假设,即:在梯度梯度下降的超度线回归制度下,我们放松了先前的严格假设。本文扩大了良性调整范围,扩大了稳妥地过度调整范围,实验结果表明,拟议的约束与经验证据更一致。