相异的贝耶斯过滤器的过滤为采用随机牛顿法最大限度地减少日志- covex 函数提供了动力 (Discriminative Bayesian filtering lends momentum to the stochastic Newton method for minimizing log-convex functions) - 专知论文

会员服务 ·

0

动量 · 判别器 · 优化器 · 子采样 · 泛函 ·

2022 年 6 月 22 日

Discriminative Bayesian filtering lends momentum to the stochastic Newton method for minimizing log-convex functions

翻译：相异的贝耶斯过滤器的过滤为采用随机牛顿法最大限度地减少日志- covex 函数提供了动力

Michael C. Burkhart

from arxiv, to appear in: Optimization Letters (2022)

To minimize the average of a set of log-convex functions, the stochastic Newton method iteratively updates its estimate using subsampled versions of the full objective's gradient and Hessian. We contextualize this optimization problem as sequential Bayesian inference on a latent state-space model with a discriminatively-specified observation process. Applying Bayesian filtering then yields a novel optimization algorithm that considers the entire history of gradients and Hessians when forming an update. We establish matrix-based conditions under which the effect of older observations diminishes over time, in a manner analogous to Polyak's heavy ball momentum. We illustrate various aspects of our approach with an example and review other relevant innovations for the stochastic Newton method.

翻译：为了最大限度地减少一组对数分流函数的平均值,牛顿随机法利用子抽样版本的全目标梯度和海珊来迭代更新其估计值。我们将此优化问题背景化为通过歧视性特定观测过程对潜伏状态空间模型进行连续贝叶斯式推论。应用贝叶斯过滤法然后产生一种新颖的优化算法, 在形成更新时考虑梯度和赫斯历程的整个历史。我们建立了基于矩阵的条件, 在这些条件下,老观测结果的影响会随着时间的流逝而减少, 其方式类似于波里雅克的重球动力。我们用一个实例来说明我们方法的方方面面, 并审查对牛顿方法的其他相关创新。

0

相关内容

动量方法 (Polyak, 1964) 旨在加速学习，特别是处理高曲率、小但一致的梯度，或是带噪声的梯度。动量算法积累了之前梯度指数级衰减的移动平均，并且继续沿该方向移动。

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

专知会员服务

72+阅读 · 2022年7月11日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

Underlay频谱共享方式下信号参数估计和调制识别的方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

差分共振声谱法岩石纵、横波速度的低频测量方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

长码直扩信号扩频序列估计方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

Hadoop云存储中基于Ordinal Bloom filter的多维索引关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

改进Max-SAT算法的关键技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

遍历哈密顿系统的谱理论

国家自然科学基金

0+阅读 · 2009年12月31日

基于局部平均采样的多维随机场景重构原理与方法

国家自然科学基金

0+阅读 · 2008年12月31日

Super-Universal Regularized Newton Method

Arxiv

0+阅读 · 2022年8月11日

Non-Asymptotic Analysis of Stochastic Approximation Algorithms for Streaming Data

Non-Asymptotic Analysis of Stochastic Approximation Algorithms for Streaming Data

Arxiv

0+阅读 · 2022年8月11日

A note on $Γ$-convergence of Tikhonov functionals for nonlinear inverse problems

Arxiv

0+阅读 · 2022年8月11日

Maximum norm a posteriori error estimates for convection-diffusion problems

Maximum norm a posteriori error estimates for convection-diffusion problems

Arxiv

0+阅读 · 2022年8月11日

ATLAS: Universal Function Approximator for Memory Retention

ATLAS: Universal Function Approximator for Memory Retention

Arxiv

0+阅读 · 2022年8月10日

Testing for error invariance in separable instrumental variable models

Arxiv

0+阅读 · 2022年8月10日

Convergence of denoising diffusion models under the manifold hypothesis

Arxiv

0+阅读 · 2022年8月10日

Adaptive Learning Rates for Faster Stochastic Gradient Methods

Arxiv

0+阅读 · 2022年8月10日

Massive MIMO Adaptive Modulation and Coding Using Online Deep Learning Algorithm

Arxiv

0+阅读 · 2022年8月9日

Test for non-negligible adverse shifts

Arxiv

0+阅读 · 2022年8月8日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

专知会员服务

72+阅读 · 2022年7月11日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】低维与高维空间中潜在表征的分析、建模与变换

《生态建模密码破译：建模与编程实践》美陆军最新报告

大模型解决方案白皮书：社交陪伴场景全流程落地指南

面向具身操作的视觉-语言-动作模型综述

相关资讯

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Super-Universal Regularized Newton Method

Arxiv

0+阅读 · 2022年8月11日

Non-Asymptotic Analysis of Stochastic Approximation Algorithms for Streaming Data

Non-Asymptotic Analysis of Stochastic Approximation Algorithms for Streaming Data

Arxiv

0+阅读 · 2022年8月11日

A note on $Γ$-convergence of Tikhonov functionals for nonlinear inverse problems

Arxiv

0+阅读 · 2022年8月11日

Maximum norm a posteriori error estimates for convection-diffusion problems

Maximum norm a posteriori error estimates for convection-diffusion problems

Arxiv

0+阅读 · 2022年8月11日

ATLAS: Universal Function Approximator for Memory Retention

ATLAS: Universal Function Approximator for Memory Retention

Arxiv

0+阅读 · 2022年8月10日

Testing for error invariance in separable instrumental variable models

Arxiv

0+阅读 · 2022年8月10日

Convergence of denoising diffusion models under the manifold hypothesis

Arxiv

0+阅读 · 2022年8月10日

Adaptive Learning Rates for Faster Stochastic Gradient Methods

Arxiv

0+阅读 · 2022年8月10日

Massive MIMO Adaptive Modulation and Coding Using Online Deep Learning Algorithm

Arxiv

0+阅读 · 2022年8月9日

Test for non-negligible adverse shifts

Arxiv

0+阅读 · 2022年8月8日

相关基金

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

Underlay频谱共享方式下信号参数估计和调制识别的方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

差分共振声谱法岩石纵、横波速度的低频测量方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

长码直扩信号扩频序列估计方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

Hadoop云存储中基于Ordinal Bloom filter的多维索引关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

改进Max-SAT算法的关键技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

遍历哈密顿系统的谱理论

国家自然科学基金

0+阅读 · 2009年12月31日

基于局部平均采样的多维随机场景重构原理与方法

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员