Langevin 扩散第一命令分解的高级命令一般化错误 (Higher Order Generalization Error for First Order Discretization of Langevin Diffusion) - 专知论文

会员服务 ·

0

泛化误差 · 泛化理论 · 离散化 · 平滑 · 损失函数（机器学习） ·

2021 年 2 月 11 日

Higher Order Generalization Error for First Order Discretization of Langevin Diffusion

翻译：Langevin 扩散第一命令分解的高级命令一般化错误

Mufan Bill Li,Maxime Gazeau

We propose a novel approach to analyze generalization error for discretizations of Langevin diffusion, such as the stochastic gradient Langevin dynamics (SGLD). For an $\epsilon$ tolerance of expected generalization error, it is known that a first order discretization can reach this target if we run $\Omega(\epsilon^{-1} \log (\epsilon^{-1}) )$ iterations with $\Omega(\epsilon^{-1})$ samples. In this article, we show that with additional smoothness assumptions, even first order methods can achieve arbitrarily runtime complexity. More precisely, for each $N>0$, we provide a sufficient smoothness condition on the loss function such that a first order discretization can reach $\epsilon$ expected generalization error given $\Omega( \epsilon^{-1/N} \log (\epsilon^{-1}) )$ iterations with $\Omega(\epsilon^{-1})$ samples.

翻译：我们提出一种新的方法来分析朗埃文扩散的离散性差错,例如Stochistic 梯度Langevin动态(SGLD)等。对于预期普遍化差错的容度,我们知道,如果我们用$Omega(\epsilon ⁇ -1}\log(\epsilon ⁇ -1})来运行以$Omega(\epsilon ⁇ -1})为样本的折叠性差错,则第一级离散性就能够达到这个目标。在文章中,我们表明,如果增加顺畅性假设,即使第一级方法也能实现任意运行时间的复杂性。更确切地说,对于每1美元,我们为损失函数提供了足够的顺畅性条件,以便第一级离异性能达到美元预期的普遍差错,给$Omega(\epsilon ⁇ -1}(\epsilon ⁇ -1}(\\\ ipsilon ⁇ -1})。

0

相关内容

泛化误差

学习方法的泛化能力（Generalization Error）是由该方法学习到的模型对未知数据的预测能力，是学习方法本质上重要的性质。现实中采用最多的办法是通过测试泛化误差来评价学习方法的泛化能力。泛化误差界刻画了学习算法的经验风险与期望风险之间偏差和收敛速度。一个机器学习的泛化误差（Generalization Error），是一个描述学生机器在从样品数据中学习之后，离教师机器之间的差距的函数。

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【UIUC】最新《自监督学习》教程，51页ppt，Self-supervised learning

【UIUC】最新《自监督学习》教程，51页ppt，Self-supervised learning

专知会员服务

84+阅读 · 2020年11月25日

【EMNLP2020】序列知识蒸馏进展，44页ppt

【EMNLP2020】序列知识蒸馏进展，44页ppt

专知会员服务

39+阅读 · 2020年11月21日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

图解FixMatch的半监督学习，The Illustrated FixMatch for Semi-Supervised Learning

图解FixMatch的半监督学习，The Illustrated FixMatch for Semi-Supervised Learning

专知会员服务

26+阅读 · 2020年4月2日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【AAAI2020论文】小样本网络压缩，Few Shot Network Compression via Cross Distillation (附pdf）

专知会员服务

26+阅读 · 2019年11月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

已删除

将门创投

8+阅读 · 2019年3月18日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Prophet Inequalities for I.I.D. Random Variables from an Unknown Distribution

Arxiv

0+阅读 · 2021年4月7日

Computing the Characteristic Polynomial of Generic Toeplitz-like and Hankel-like Matrices

Arxiv

0+阅读 · 2021年4月6日

Multi-Robot Pickup and Delivery via Distributed Resource Allocation

Arxiv

0+阅读 · 2021年4月6日

Discontinuous Galerkin method for blow-up solutions of nonlinear 1D wave equations

Arxiv

0+阅读 · 2021年4月5日

Self-Healing First-Order Distributed Optimization

Self-Healing First-Order Distributed Optimization

Arxiv

0+阅读 · 2021年4月5日

Multilevel Stein variational gradient descent with applications to Bayesian inverse problems

Arxiv

0+阅读 · 2021年4月5日

Cluster-based Distributed Augmented Lagrangian Algorithm for a Class of Constrained Convex Optimization Problems

Arxiv

0+阅读 · 2021年4月2日

A remark on discretization of the uniform norm

Arxiv

0+阅读 · 2021年4月2日

Factorized Graph Representations for Semi-Supervised Learning from Sparse Data

Arxiv

4+阅读 · 2020年3月5日

Coulomb GANs: Provably Optimal Nash Equilibria via Potential Fields

Arxiv

4+阅读 · 2018年1月30日

VIP会员

文章信息

相关主题

损失函数（机器学习）

相关VIP内容

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【UIUC】最新《自监督学习》教程，51页ppt，Self-supervised learning

【UIUC】最新《自监督学习》教程，51页ppt，Self-supervised learning

专知会员服务

84+阅读 · 2020年11月25日

【EMNLP2020】序列知识蒸馏进展，44页ppt

【EMNLP2020】序列知识蒸馏进展，44页ppt

专知会员服务

39+阅读 · 2020年11月21日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

图解FixMatch的半监督学习，The Illustrated FixMatch for Semi-Supervised Learning

图解FixMatch的半监督学习，The Illustrated FixMatch for Semi-Supervised Learning

专知会员服务

26+阅读 · 2020年4月2日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【AAAI2020论文】小样本网络压缩，Few Shot Network Compression via Cross Distillation (附pdf）

专知会员服务

26+阅读 · 2019年11月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

从社会学实验到行为仿真：理解基于Agent的观点动力学建模思维

中英文版《GPT-5 System Card速览》报告

ACL 2025 | 大模型结构化知识提示的泛化能力研究

【普林斯顿博士论文】大型模型的高效推理

相关资讯

已删除

将门创投

8+阅读 · 2019年3月18日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Prophet Inequalities for I.I.D. Random Variables from an Unknown Distribution

Arxiv

0+阅读 · 2021年4月7日

Computing the Characteristic Polynomial of Generic Toeplitz-like and Hankel-like Matrices

Arxiv

0+阅读 · 2021年4月6日

Multi-Robot Pickup and Delivery via Distributed Resource Allocation

Arxiv

0+阅读 · 2021年4月6日

Discontinuous Galerkin method for blow-up solutions of nonlinear 1D wave equations

Arxiv

0+阅读 · 2021年4月5日

Self-Healing First-Order Distributed Optimization

Self-Healing First-Order Distributed Optimization

Arxiv

0+阅读 · 2021年4月5日

Multilevel Stein variational gradient descent with applications to Bayesian inverse problems

Arxiv

0+阅读 · 2021年4月5日

Cluster-based Distributed Augmented Lagrangian Algorithm for a Class of Constrained Convex Optimization Problems

Arxiv

0+阅读 · 2021年4月2日

A remark on discretization of the uniform norm

Arxiv

0+阅读 · 2021年4月2日

Factorized Graph Representations for Semi-Supervised Learning from Sparse Data

Arxiv

4+阅读 · 2020年3月5日

Coulomb GANs: Provably Optimal Nash Equilibria via Potential Fields

Arxiv

4+阅读 · 2018年1月30日

微信扫码咨询专知VIP会员