隐含SGD的统计推断值:近似Robbins-Monro对Polyak-Ruppert (Statistical inference with implicit SGD: proximal Robbins-Monro vs. Polyak-Ruppert) - 专知论文

会员服务 ·

0

估计/估计量 · SGD · Analysis · 统计量 · 推断 ·

2022 年 6 月 25 日

Statistical inference with implicit SGD: proximal Robbins-Monro vs. Polyak-Ruppert

翻译：隐含SGD的统计推断值:近似Robbins-Monro对Polyak-Ruppert

Yoonhyung Lee,Sungdong Lee,Joong-Ho Won

from arxiv, Accepted to the 39 th International Conference on Machine Learning

The implicit stochastic gradient descent (ISGD), a proximal version of SGD, is gaining interest in the literature due to its stability over (explicit) SGD. In this paper, we conduct an in-depth analysis of the two modes of ISGD for smooth convex functions, namely proximal Robbins-Monro (proxRM) and proximal Poylak-Ruppert (proxPR) procedures, for their use in statistical inference on model parameters. Specifically, we derive non-asymptotic point estimation error bounds of both proxRM and proxPR iterates and their limiting distributions, and propose on-line estimators of their asymptotic covariance matrices that require only a single run of ISGD. The latter estimators are used to construct valid confidence intervals for the model parameters. Our analysis is free of the generalized linear model assumption that has limited the preceding analyses, and employs feasible procedures. Our on-line covariance matrix estimators appear to be the first of this kind in the ISGD literature.

翻译：隐含的测深梯度下沉(ISGD)是SGD的近似版本,对文献越来越感兴趣,因为它在(明确的)SGD上具有稳定性。在本文件中,我们对ISGD的两种模式进行深入分析,即光滑的二次曲线函数模式,即准Robbins-Monro(ProxRM)和准Polylak-Ruppert(proxPR)程序,用于模型参数的统计推断。具体地说,我们得出了ProxRM和ProxPRERExerates及其限制分布的非抽点估计误差界限,并提议在线估计其单次运行ISGD的无症状共变矩阵。后者用于为模型参数构建有效的信任间隔。我们的分析没有使用限制先前分析的通用线性模型假设,而是采用可行的程序。我们的在线变量矩阵估计器似乎是ISGD文献中这类模型的第一个。

0

相关内容

估计/估计量

估计/估计量

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

活性氧调控植物根生长的分子机理

国家自然科学基金

0+阅读 · 2015年12月31日

PPARγ外源激动剂早期干预NNK诱导肺癌干细胞中15-LOX表达调控的机制

国家自然科学基金

0+阅读 · 2014年12月31日

大规模分布式系统中服务失效的自动诊断方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

羊八井观测站大气不透明度的测量

国家自然科学基金

0+阅读 · 2013年12月31日

高光谱遥感图像解混的稀疏性正则化方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

TRAF1在心肌梗死后心室重构中的作用及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于复值ICA和张量分解的完备fMRI数据分析方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于信道Time/Power度量指标的TOA测距误差模型及其应用研究

国家自然科学基金

0+阅读 · 2011年12月31日

多天线OFDM信道全信息压缩估计理论与方法

国家自然科学基金

0+阅读 · 2011年12月31日

AdaBin: Improving Binary Neural Networks with Adaptive Binary Sets

Arxiv

0+阅读 · 2022年8月17日

Reweighting the RCT for generalization: finite sample analysis and variable selection

Arxiv

0+阅读 · 2022年8月16日

Convergence Rates for Stochastic Approximation on a Boundary

Arxiv

0+阅读 · 2022年8月15日

Doubly Robust Estimation under Covariate-induced Dependent Left Truncation

Arxiv

0+阅读 · 2022年8月14日

Optimal Recovery for Causal Inference

Arxiv

0+阅读 · 2022年8月13日

Approximate Post-Selective Inference for Regression with the Group LASSO

Arxiv

0+阅读 · 2022年8月13日

Causal Discovery in Probabilistic Networks with an Identifiable Causal Effect

Arxiv

0+阅读 · 2022年8月13日

Variational Quantum Approximate Support Vector Machine With Inference Transfer

Arxiv

0+阅读 · 2022年8月12日

Machine learning in front of statistical methods for prediction spread SARS-CoV-2 in Colombia

Machine learning in front of statistical methods for prediction spread SARS-CoV-2 in Colombia

Arxiv

0+阅读 · 2022年8月12日

An Accelerated Doubly Stochastic Gradient Method with Faster Explicit Model Identification

Arxiv

0+阅读 · 2022年8月11日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津博士论文】面向视觉、物理与语言应用的可信机器学习模型

医学领域大型语言模型的新进展

战场AI决策支持系统

【NeurIPS 2025】视觉指令瓶颈微调

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

AdaBin: Improving Binary Neural Networks with Adaptive Binary Sets

Arxiv

0+阅读 · 2022年8月17日

Reweighting the RCT for generalization: finite sample analysis and variable selection

Arxiv

0+阅读 · 2022年8月16日

Convergence Rates for Stochastic Approximation on a Boundary

Arxiv

0+阅读 · 2022年8月15日

Doubly Robust Estimation under Covariate-induced Dependent Left Truncation

Arxiv

0+阅读 · 2022年8月14日

Optimal Recovery for Causal Inference

Arxiv

0+阅读 · 2022年8月13日

Approximate Post-Selective Inference for Regression with the Group LASSO

Arxiv

0+阅读 · 2022年8月13日

Causal Discovery in Probabilistic Networks with an Identifiable Causal Effect

Arxiv

0+阅读 · 2022年8月13日

Variational Quantum Approximate Support Vector Machine With Inference Transfer

Arxiv

0+阅读 · 2022年8月12日

Machine learning in front of statistical methods for prediction spread SARS-CoV-2 in Colombia

Machine learning in front of statistical methods for prediction spread SARS-CoV-2 in Colombia

Arxiv

0+阅读 · 2022年8月12日

An Accelerated Doubly Stochastic Gradient Method with Faster Explicit Model Identification

Arxiv

0+阅读 · 2022年8月11日

相关基金

活性氧调控植物根生长的分子机理

国家自然科学基金

0+阅读 · 2015年12月31日

PPARγ外源激动剂早期干预NNK诱导肺癌干细胞中15-LOX表达调控的机制

国家自然科学基金

0+阅读 · 2014年12月31日

大规模分布式系统中服务失效的自动诊断方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

羊八井观测站大气不透明度的测量

国家自然科学基金

0+阅读 · 2013年12月31日

高光谱遥感图像解混的稀疏性正则化方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

TRAF1在心肌梗死后心室重构中的作用及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于复值ICA和张量分解的完备fMRI数据分析方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于信道Time/Power度量指标的TOA测距误差模型及其应用研究

国家自然科学基金

0+阅读 · 2011年12月31日

多天线OFDM信道全信息压缩估计理论与方法

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员