放松地物共变假设: 线性倒退中优度调整的时间- 变形宽度 (Relaxing the Feature Covariance Assumption: Time-Variant Bounds for Benign Overfitting in Linear Regression)

Benign overfitting demonstrates that overparameterized models can perform well on test data while fitting noisy training data. However, it only considers the final min-norm solution in linear regression, which ignores the algorithm information and the corresponding training procedure. In this paper, we generalize the idea of benign overfitting to the whole training trajectory instead of the min-norm solution and derive a time-variant bound based on the trajectory analysis. Starting from the time-variant bound, we further derive a time interval that suffices to guarantee a consistent generalization error for a given feature covariance. Unlike existing approaches, the newly proposed generalization bound is characterized by a time-variant effective dimension of feature covariance. By introducing the time factor, we relax the strict assumption on the feature covariance matrix required in previous benign overfitting under the regimes of overparameterized linear regression with gradient descent. This paper extends the scope of benign overfitting, and experiment results indicate that the proposed bound accords better with empirical evidence.

翻译：过度调整表明,超度模型在安装噪音培训数据的同时,在测试数据上能够很好地发挥作用;然而,它只考虑线性回归的最后中中下方溶液,它忽略了算法信息和相应的培训程序;在本文中,我们概括了将稳妥地过度适应整个培训轨迹而不是中下方溶液的想法,并根据轨迹分析得出一个时间差的界限。从时间差的界限开始,我们进一步得出一个时间间隔,足以保证对特定特征的共变性有一个一致的普遍化错误。与现有方法不同,新提出的一般化约束的特点是特征共变化具有时间差的有效维度。通过引入时间因素,我们放松了先前的稳妥地过度适应参数的特征共变矩阵的严格假设,即:在梯度梯度下降的超度线回归制度下,我们放松了先前的严格假设。本文扩大了良性调整范围,扩大了稳妥地过度调整范围,实验结果表明,拟议的约束与经验证据更一致。

相关内容

过拟合

关注 8

过拟合，在AI领域多指机器学习得到模型太过复杂，导致在训练集上表现很好，然而在测试集上却不尽人意。过拟合（over-fitting）也称为过学习，它的直观表现是算法在训练集上表现好，但在测试集上表现不好，泛化性能差。过拟合是在模型参数拟合过程中由于训练数据包含抽样误差，在训练时复杂的模型将抽样误差也进行了拟合导致的。

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

专知会员服务

134+阅读 · 2020年4月14日