实现更好的普遍化,与地方的弹性稳定接轨 (Toward Better Generalization Bounds with Locally Elastic Stability) - 专知论文

会员服务 ·

0

泛化理论 · 均匀稳定性 · UniFormer · Better · 数据点 ·

2021 年 7 月 13 日

Toward Better Generalization Bounds with Locally Elastic Stability

翻译：实现更好的普遍化,与地方的弹性稳定接轨

Zhun Deng,Hangfeng He,Weijie J. Su

from arxiv, Published in ICML 2021

Algorithmic stability is a key characteristic to ensure the generalization ability of a learning algorithm. Among different notions of stability, \emph{uniform stability} is arguably the most popular one, which yields exponential generalization bounds. However, uniform stability only considers the worst-case loss change (or so-called sensitivity) by removing a single data point, which is distribution-independent and therefore undesirable. There are many cases that the worst-case sensitivity of the loss is much larger than the average sensitivity taken over the single data point that is removed, especially in some advanced models such as random feature models or neural networks. Many previous works try to mitigate the distribution independent issue by proposing weaker notions of stability, however, they either only yield polynomial bounds or the bounds derived do not vanish as sample size goes to infinity. Given that, we propose \emph{locally elastic stability} as a weaker and distribution-dependent stability notion, which still yields exponential generalization bounds. We further demonstrate that locally elastic stability implies tighter generalization bounds than those derived based on uniform stability in many situations by revisiting the examples of bounded support vector machines, regularized least square regressions, and stochastic gradient descent.

翻译：解析稳定性是确保学习算法普遍化能力的一个关键特征。在不同的稳定性概念中, \ emph{ uniform stability} 可以说是最受欢迎的概念, 最受欢迎的概念可以产生指数化的概括性界限。然而, 统一稳定性仅考虑最坏的损失变化( 或所谓的敏感度), 只需删除一个单一的数据点, 数据点是分布独立的, 因此是不可取的。有许多例子显示, 损失的最坏情况敏感度远大于对被删除的单一数据点的平均敏感度, 特别是在一些先进的模型中, 如随机特征模型或神经网络。许多以前的工作试图通过提出较弱的稳定概念来缓解分配独立的问题, 但是, 它们可能只产生多度界限, 或衍生的界限不会随着样本大小变得无限而消失。有鉴于此, 我们建议 \ emph{ 地方弹性稳定性概念仍然产生指数化的概括性概括性概括性界限。我们进一步证明, 地方弹性稳定性意味着比基于许多局势中统一性稳定的结果更紧密的概括性, 通过重新审视定式的矢量级后级机, 支持固定式的定式的机级后级机。

0

相关内容

泛化理论

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【MLA 2019】机器学习中分布式鲁棒优化的一阶算法框架( Towards a First-Order Algorithmic Framework for Distributionally Robust Optimization in Machine Learning),香港中文大学苏文藻

【MLA 2019】机器学习中分布式鲁棒优化的一阶算法框架( Towards a First-Order Algorithmic Framework for Distributionally Robust Optimization in Machine Learning),香港中文大学苏文藻

专知会员服务

28+阅读 · 2019年11月6日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

TCN v2 + 3Dconv 运动信息

TCN v2 + 3Dconv 运动信息

CreateAMind

4+阅读 · 2019年1月8日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Error analysis for 2D stochastic Navier--Stokes equations in bounded domains

Arxiv

0+阅读 · 2021年9月14日

Tail bounds for empirically standardized sums

Arxiv

0+阅读 · 2021年9月13日

A general framework for inference on algorithm-agnostic variable importance

Arxiv

0+阅读 · 2021年9月13日

Upper Bounds on the Generalization Error of Private Algorithms for Discrete Data

Arxiv

0+阅读 · 2021年9月13日

Uniform Generalization Bounds for Overparameterized Neural Networks

Uniform Generalization Bounds for Overparameterized Neural Networks

Arxiv

0+阅读 · 2021年9月13日

Convergence of Likelihood Ratios and Estimators for Selection in non-neutral Wright-Fisher Diffusions

Arxiv

0+阅读 · 2021年9月13日

Functional Linear Regression with Mixed Predictors

Arxiv

0+阅读 · 2021年9月13日

Robust Convergence of Parareal Algorithms with Arbitrarily High-order Fine Propagators

Arxiv

0+阅读 · 2021年9月11日

Optimal Classification for Functional Data

Arxiv

0+阅读 · 2021年9月10日

In Defense of Uniform Convergence: Generalization via derandomization with an application to interpolating predictors

Arxiv

0+阅读 · 2021年9月10日

VIP会员

文章信息

相关主题

均匀稳定性

相关VIP内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【MLA 2019】机器学习中分布式鲁棒优化的一阶算法框架( Towards a First-Order Algorithmic Framework for Distributionally Robust Optimization in Machine Learning),香港中文大学苏文藻

【MLA 2019】机器学习中分布式鲁棒优化的一阶算法框架( Towards a First-Order Algorithmic Framework for Distributionally Robust Optimization in Machine Learning),香港中文大学苏文藻

专知会员服务

28+阅读 · 2019年11月6日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《毁灭算法：解析以色列在加沙的AI军事行动》

【COLT 2025最新教程】语言生成

以机器速度锁定目标：人工智能的能力与局限

【ICML2025】通过在线世界模型规划的持续强化学习

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

TCN v2 + 3Dconv 运动信息

TCN v2 + 3Dconv 运动信息

CreateAMind

4+阅读 · 2019年1月8日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Error analysis for 2D stochastic Navier--Stokes equations in bounded domains

Arxiv

0+阅读 · 2021年9月14日

Tail bounds for empirically standardized sums

Arxiv

0+阅读 · 2021年9月13日

A general framework for inference on algorithm-agnostic variable importance

Arxiv

0+阅读 · 2021年9月13日

Upper Bounds on the Generalization Error of Private Algorithms for Discrete Data

Arxiv

0+阅读 · 2021年9月13日

Uniform Generalization Bounds for Overparameterized Neural Networks

Uniform Generalization Bounds for Overparameterized Neural Networks

Arxiv

0+阅读 · 2021年9月13日

Convergence of Likelihood Ratios and Estimators for Selection in non-neutral Wright-Fisher Diffusions

Arxiv

0+阅读 · 2021年9月13日

Functional Linear Regression with Mixed Predictors

Arxiv

0+阅读 · 2021年9月13日

Robust Convergence of Parareal Algorithms with Arbitrarily High-order Fine Propagators

Arxiv

0+阅读 · 2021年9月11日

Optimal Classification for Functional Data

Arxiv

0+阅读 · 2021年9月10日

In Defense of Uniform Convergence: Generalization via derandomization with an application to interpolating predictors

Arxiv

0+阅读 · 2021年9月10日

微信扫码咨询专知VIP会员