L2M: 实用的后继近似近似拉拉普拉普,以优化为驱动的第二秒估计 (L2M: Practical posterior Laplace approximation with optimization-driven second moment estimation) - 专知论文

会员服务 ·

0

估计/估计量 · 矩 · 近似 · Neural Networks · AdaGrad ·

2021 年 7 月 9 日

L2M: Practical posterior Laplace approximation with optimization-driven second moment estimation

翻译：L2M: 实用的后继近似近似拉拉普拉普,以优化为驱动的第二秒估计

Christian S. Perone,Roberto Pereira Silveira,Thomas Paula

from arxiv, 6 pages, 1 figure, accepted for ICML 2021 UDL Workshop

Uncertainty quantification for deep neural networks has recently evolved through many techniques. In this work, we revisit Laplace approximation, a classical approach for posterior approximation that is computationally attractive. However, instead of computing the curvature matrix, we show that, under some regularity conditions, the Laplace approximation can be easily constructed using the gradient second moment. This quantity is already estimated by many exponential moving average variants of Adagrad such as Adam and RMSprop, but is traditionally discarded after training. We show that our method (L2M) does not require changes in models or optimization, can be implemented in a few lines of code to yield reasonable results, and it does not require any extra computational steps besides what is already being computed by optimizers, without introducing any new hyperparameter. We hope our method can open new research directions on using quantities already computed by optimizers for uncertainty estimation in deep neural networks.

翻译：深神经网络的不确定性量化最近通过许多技术演变而来。在这项工作中,我们重新审视了Laplace近似(一种具有计算吸引力的经典后近似近似法),这是一种具有逻辑吸引力的经典方法。然而,我们不计算曲线矩阵,而是表明,在某些常规条件下,Laplace近近似可以很容易地使用梯度第二秒来构建。这个数量已经由Adagrad(如Adam和RMSpro)的许多指数移动平均变体来估算,但传统上在培训后被丢弃。我们显示,我们的方法(L2M)不需要改变模型或优化,可以在几行代码中实施,以产生合理的结果,而且除了优化者已经计算过的计算之外,也不要求任何额外的计算步骤,而不引入任何新的超参数。我们希望我们的方法能够打开新的研究方向,即使用优化者已经计算的数量来在深神经网络中进行不确定性估计。

0

相关内容

估计/估计量

估计/估计量

深度概率图模型，Deep Probabilistic Models

专知会员服务

29+阅读 · 2021年8月2日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

CCF-A类顶会WWW2021论文结果出炉，357篇上榜！你的论文中了吗？

CCF-A类顶会WWW2021论文结果出炉，357篇上榜！你的论文中了吗？

专知会员服务

47+阅读 · 2021年1月17日

【斯坦福】凸优化圣经- Convex Optimization （附730pdf下载）

【斯坦福】凸优化圣经- Convex Optimization （附730pdf下载）

专知会员服务

229+阅读 · 2020年6月5日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

随机特征核近似综述: 算法与理论，Random Features for Kernel Approximation: A Survey in Algorithms, Theory, and Beyond

随机特征核近似综述: 算法与理论，Random Features for Kernel Approximation: A Survey in Algorithms, Theory, and Beyond

专知会员服务

33+阅读 · 2020年4月26日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

意识是一种数学模式

意识是一种数学模式

CreateAMind

3+阅读 · 2019年6月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

领域自适应学习论文大列表

领域自适应学习论文大列表

专知

71+阅读 · 2019年3月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

人体姿态估计资源大列表（Human Pose Estimation）

人体姿态估计资源大列表（Human Pose Estimation）

专知

9+阅读 · 2018年10月6日

神经网络学习率设置

神经网络学习率设置

机器学习研究会

4+阅读 · 2018年3月3日

五个精彩实用的自然语言处理资源

五个精彩实用的自然语言处理资源

机器学习研究会

6+阅读 · 2018年2月23日

NIPS 2017：贝叶斯深度学习与深度贝叶斯学习（讲义+视频）

NIPS 2017：贝叶斯深度学习与深度贝叶斯学习（讲义+视频）

机器学习研究会

36+阅读 · 2017年12月10日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

Nonparametric Estimation of Truncated Conditional Expectation Functions

Arxiv

0+阅读 · 2021年9月13日

Direct Advantage Estimation

Arxiv

0+阅读 · 2021年9月13日

Optimal pointwise sampling for $L^2$ approximation

Arxiv

0+阅读 · 2021年9月13日

Optimal Bounds for the $k$-cut Problem

Arxiv

0+阅读 · 2021年9月12日

Approximation metatheorems for classes with bounded expansion

Arxiv

0+阅读 · 2021年9月11日

Posterior Concentration Rates for Bayesian O'Sullivan Penalized Splines

Arxiv

0+阅读 · 2021年9月9日

Projection Estimators of the Stationary Density of a Differential Equation Driven by the Fractional Brownian Motion

Arxiv

0+阅读 · 2021年9月8日

A Second-Order Nonlocal Approximation for Manifold Poisson Model with Dirichlet Boundary

Arxiv

0+阅读 · 2021年9月8日

Human Pose Regression with Residual Log-likelihood Estimation

Arxiv

4+阅读 · 2021年7月26日

Implicit Maximum Likelihood Estimation

Implicit Maximum Likelihood Estimation

Arxiv

7+阅读 · 2018年9月24日

VIP会员

文章信息

相关主题

估计/估计量

Neural Networks

相关VIP内容

深度概率图模型，Deep Probabilistic Models

专知会员服务

29+阅读 · 2021年8月2日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

CCF-A类顶会WWW2021论文结果出炉，357篇上榜！你的论文中了吗？

CCF-A类顶会WWW2021论文结果出炉，357篇上榜！你的论文中了吗？

专知会员服务

47+阅读 · 2021年1月17日

【斯坦福】凸优化圣经- Convex Optimization （附730pdf下载）

【斯坦福】凸优化圣经- Convex Optimization （附730pdf下载）

专知会员服务

229+阅读 · 2020年6月5日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

随机特征核近似综述: 算法与理论，Random Features for Kernel Approximation: A Survey in Algorithms, Theory, and Beyond

随机特征核近似综述: 算法与理论，Random Features for Kernel Approximation: A Survey in Algorithms, Theory, and Beyond

专知会员服务

33+阅读 · 2020年4月26日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

意识是一种数学模式

意识是一种数学模式

CreateAMind

3+阅读 · 2019年6月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

领域自适应学习论文大列表

领域自适应学习论文大列表

专知

71+阅读 · 2019年3月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

人体姿态估计资源大列表（Human Pose Estimation）

人体姿态估计资源大列表（Human Pose Estimation）

专知

9+阅读 · 2018年10月6日

神经网络学习率设置

神经网络学习率设置

机器学习研究会

4+阅读 · 2018年3月3日

五个精彩实用的自然语言处理资源

五个精彩实用的自然语言处理资源

机器学习研究会

6+阅读 · 2018年2月23日

NIPS 2017：贝叶斯深度学习与深度贝叶斯学习（讲义+视频）

NIPS 2017：贝叶斯深度学习与深度贝叶斯学习（讲义+视频）

机器学习研究会

36+阅读 · 2017年12月10日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

相关论文

Nonparametric Estimation of Truncated Conditional Expectation Functions

Arxiv

0+阅读 · 2021年9月13日

Direct Advantage Estimation

Arxiv

0+阅读 · 2021年9月13日

Optimal pointwise sampling for $L^2$ approximation

Arxiv

0+阅读 · 2021年9月13日

Optimal Bounds for the $k$-cut Problem

Arxiv

0+阅读 · 2021年9月12日

Approximation metatheorems for classes with bounded expansion

Arxiv

0+阅读 · 2021年9月11日

Posterior Concentration Rates for Bayesian O'Sullivan Penalized Splines

Arxiv

0+阅读 · 2021年9月9日

Projection Estimators of the Stationary Density of a Differential Equation Driven by the Fractional Brownian Motion

Arxiv

0+阅读 · 2021年9月8日

A Second-Order Nonlocal Approximation for Manifold Poisson Model with Dirichlet Boundary

Arxiv

0+阅读 · 2021年9月8日

Human Pose Regression with Residual Log-likelihood Estimation

Arxiv

4+阅读 · 2021年7月26日

Implicit Maximum Likelihood Estimation

Implicit Maximum Likelihood Estimation

Arxiv

7+阅读 · 2018年9月24日

微信扫码咨询专知VIP会员