Lipschitz 之后:全批GD的简单化和超高风险弹道 (Beyond Lipschitz: Sharp Generalization and Excess Risk Bounds for Full-Batch GD) - 专知论文

会员服务 ·

0

通用动力公司 · 泛化理论 · 泛化误差 · 非凸 · Lipschitz ·

2022 年 4 月 26 日

Beyond Lipschitz: Sharp Generalization and Excess Risk Bounds for Full-Batch GD

翻译：Lipschitz 之后:全批GD的简单化和超高风险弹道

Konstantinos E. Nikolakakis,Farzin Haddadpour,Amin Karbasi,Dionysios S. Kalogerias

from arxiv, 30 pages

We provide sharp path-dependent generalization and excess error guarantees for the full-batch Gradient Decent (GD) algorithm for smooth losses (possibly non-Lipschitz, possibly nonconvex). At the heart of our analysis is a novel generalization error technique for deterministic symmetric algorithms, that implies average output stability and a bounded expected gradient of the loss at termination leads to generalization. This key result shows that small generalization error occurs at stationary points, and allows us to bypass Lipschitz assumptions on the loss prevalent in previous work. For nonconvex, convex and strongly convex losses, we show the explicit dependence of the generalization error in terms of the accumulated path-dependent optimization error, terminal optimization error, number of samples, and number of iterations. For nonconvex smooth losses, we prove that full-batch GD efficiently generalizes close to any stationary point at termination, under the proper choice of a decreasing step size. Further, if the loss is nonconvex but the objective is PL, we derive vanishing bounds on the corresponding excess risk. For convex and strongly-convex smooth losses, we prove that full-batch GD generalizes even for large constant step sizes, and achieves a small excess risk while training fast. Our full-batch GD generalization error and excess risk bounds are significantly tighter than the existing bounds for (stochastic) GD, when the loss is smooth (but possibly non-Lipschitz).

翻译：我们的分析核心是确定性对称算法的一种新颖的概括性误差技术,这意味着平均产出稳定性和终止时损失的捆绑性梯度会导致普遍化。这个关键结果显示,小一般化误差发生在固定点,使我们能够绕过Lipschitz对以往工作中常见损失的假设。对于非康维克斯、混凝土和强烈的混凝土损失,我们的分析核心是确定性对称算法的新的概括性误差技术,这意味着平均产出稳定性和在终止时损失的预期梯度的捆绑性梯度导致普遍化。我们证明,完全性GD在固定点发生小的概括性差错,在正确选择步骤大小缩小的情况下,可以绕过Lipschitz对以往工作中常见损失的假设。对于非康维克斯、混凝土和强烈的混凝固性差,在累积基于路径的深度差差差上会逐渐消亡。对于相关的Gx总风险来说,Confredical-lax 完全性差(如果损失是非康化的,那么,我们就会在相同的缩缩缩缩缩缩性风险中,在最短的Gx 和最大幅度的累性风险中会证明我们总的Gx)

0

相关内容

通用动力公司

通用动力公司

通用动力公司（General Dynamics）是一家美国的国防企业集团。2008年时通用动力是世界第五大国防工业承包商。由于近年来不断的扩充和并购其他公司，通用动力现今的组成与面貌已与冷战时期时大不相同。现今通用动力包含三大业务集团：海洋、作战系统和资讯科技集团。

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

专知会员服务

18+阅读 · 2019年11月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

（氧）氮化物光电极太阳能分解水制氢的研究

国家自然科学基金

0+阅读 · 2014年12月31日

强磁场下低活化钢中合金碳化物(Fe,Cr)xCy析出的热力学机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

全固态核壳量子点敏化TiO2纳米管阵列太阳能电池的制备及光电化学性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

ADS中检测快中子束的GEM探测器的研制

国家自然科学基金

0+阅读 · 2013年12月31日

涡轮叶尖间隙光纤动态精密测量方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

无创靶向干预耳源性眩晕的新策略- - PENTS负反馈调控特定脑功能投射区频谱的研究

国家自然科学基金

0+阅读 · 2012年12月31日

三维斑点追踪显像提高心脏再同步化治疗效果的实验和临床研究

国家自然科学基金

0+阅读 · 2012年12月31日

铁基超导体角分辨光电子能谱及电子拉曼光谱的研究

国家自然科学基金

0+阅读 · 2012年12月31日

玉米大斑病菌水甘油通道蛋白StFps1基因的克隆与功能研究

国家自然科学基金

0+阅读 · 2011年12月31日

Variance Reduction for Policy-Gradient Methods via Empirical Variance Minimization

Arxiv

0+阅读 · 2022年6月14日

Risk and optimal policies in bandit experiments

Arxiv

0+阅读 · 2022年6月14日

Precise expressions for random projections: Low-rank approximation and randomized Newton

Arxiv

0+阅读 · 2022年6月13日

Empirical and Instance-Dependent Estimation of Markov Chain and Mixing Time

Arxiv

0+阅读 · 2022年6月13日

Lower Bounds for Sorting 16, 17, and 18 Elements

Arxiv

0+阅读 · 2022年6月11日

On Convergence of FedProx: Local Dissimilarity Invariant Bounds, Non-smoothness and Beyond

Arxiv

0+阅读 · 2022年6月10日

Posterior contraction for deep Gaussian process priors

Arxiv

0+阅读 · 2022年6月10日

$p$-Sparsified Sketches for Fast Multiple Output Kernel Methods

Arxiv

0+阅读 · 2022年6月10日

Information-theoretic generalization bounds for black-box learning algorithms

Arxiv

12+阅读 · 2021年10月4日

Deep learning: a statistical viewpoint

Arxiv

18+阅读 · 2021年3月16日

VIP会员

文章信息

相关主题

通用动力公司

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

专知会员服务

18+阅读 · 2019年11月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《小型无人机系统侦测追踪技术：声学、计算机视觉与深度学习融合方案》最新98页

《"牧羊人网格"拦截策略：实现无人机集群可靠拦截的新范式》

光纤无人机：反无人机系统的重大挑战

《作战建模与仿真实证研究》

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Variance Reduction for Policy-Gradient Methods via Empirical Variance Minimization

Arxiv

0+阅读 · 2022年6月14日

Risk and optimal policies in bandit experiments

Arxiv

0+阅读 · 2022年6月14日

Precise expressions for random projections: Low-rank approximation and randomized Newton

Arxiv

0+阅读 · 2022年6月13日

Empirical and Instance-Dependent Estimation of Markov Chain and Mixing Time

Arxiv

0+阅读 · 2022年6月13日

Lower Bounds for Sorting 16, 17, and 18 Elements

Arxiv

0+阅读 · 2022年6月11日

On Convergence of FedProx: Local Dissimilarity Invariant Bounds, Non-smoothness and Beyond

Arxiv

0+阅读 · 2022年6月10日

Posterior contraction for deep Gaussian process priors

Arxiv

0+阅读 · 2022年6月10日

$p$-Sparsified Sketches for Fast Multiple Output Kernel Methods

Arxiv

0+阅读 · 2022年6月10日

Information-theoretic generalization bounds for black-box learning algorithms

Arxiv

12+阅读 · 2021年10月4日

Deep learning: a statistical viewpoint

Arxiv

18+阅读 · 2021年3月16日

相关基金

（氧）氮化物光电极太阳能分解水制氢的研究

国家自然科学基金

0+阅读 · 2014年12月31日

强磁场下低活化钢中合金碳化物(Fe,Cr)xCy析出的热力学机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

全固态核壳量子点敏化TiO2纳米管阵列太阳能电池的制备及光电化学性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

ADS中检测快中子束的GEM探测器的研制

国家自然科学基金

0+阅读 · 2013年12月31日

涡轮叶尖间隙光纤动态精密测量方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

无创靶向干预耳源性眩晕的新策略- - PENTS负反馈调控特定脑功能投射区频谱的研究

国家自然科学基金

0+阅读 · 2012年12月31日

三维斑点追踪显像提高心脏再同步化治疗效果的实验和临床研究

国家自然科学基金

0+阅读 · 2012年12月31日

铁基超导体角分辨光电子能谱及电子拉曼光谱的研究

国家自然科学基金

0+阅读 · 2012年12月31日

玉米大斑病菌水甘油通道蛋白StFps1基因的克隆与功能研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员