非线性双时间-时间- 时间- 时间- 时间- 压力近似: 趋同和时- 时- 性性能 (Nonlinear Two-Time-Scale Stochastic Approximation: Convergence and Finite-Time Performance) - 专知论文

会员服务 ·

0

近似 · Performer · 控制器 · CASE · Performance ·

2021 年 3 月 23 日

Nonlinear Two-Time-Scale Stochastic Approximation: Convergence and Finite-Time Performance

翻译：非线性双时间-时间- 时间- 时间- 时间- 压力近似: 趋同和时- 时- 性性能

Two-time-scale stochastic approximation, a generalized version of the popular stochastic approximation, has found broad applications in many areas including stochastic control, optimization, and machine learning. Despite its popularity, theoretical guarantees of this method, especially its finite-time performance, are mostly achieved for the linear case while the results for the nonlinear counterpart are very sparse. Motivated by the classic control theory for singularly perturbed systems, we study in this paper the asymptotic convergence and finite-time analysis of the nonlinear two-time-scale stochastic approximation. Under some fairly standard assumptions, we provide a formula that characterizes the rate of convergence of the main iterates to the desired solutions. In particular, we show that the method achieves a convergence in expectation at a rate $\mathcal{O}(1/k^{2/3})$, where $k$ is the number of iterations. The key idea in our analysis is to properly choose the two step sizes to characterize the coupling between the fast and slow-time-scale iterates.

翻译：两种时间尺度的随机近似值是流行的随机近似值的通用版本,它在许多领域都得到了广泛的应用,包括随机控制、优化和机器学习。尽管它很受欢迎,但这一方法的理论保障,特别是其有限时间性能,大部分是线性案例的理论保障,而非线性对应方的结果则非常稀少。我们受对奇特扰动系统的经典控制理论的驱动,在本文中研究非线性双级随机近近似值的无症状趋同和有限时间分析。根据一些相当标准的假设,我们提供了一种公式,说明主要试样与理想解决办法的趋同率。特别是,我们表明,该方法达到了预期的趋同率,以$\mathcal{O}(1/k ⁇ 2/3}美元,其中美元是迭代数。我们分析的关键思想是正确选择两步尺大小,以描述快速和慢时速级的近似值之间的合并。

0

相关内容

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【ICLR2021】微分动态规划神经优化器

专知会员服务

16+阅读 · 2021年3月4日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

【电子书】人工智能编程范式（Paradigms of Artificial Intelligence Programming）1048页PDF免费下载

【电子书】人工智能编程范式（Paradigms of Artificial Intelligence Programming）1048页PDF免费下载

专知会员服务

50+阅读 · 2019年10月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

深度卷积神经网络中的降采样

深度卷积神经网络中的降采样

极市平台

12+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

gan生成图像at 1024² 的代码论文

gan生成图像at 1024² 的代码论文

CreateAMind

4+阅读 · 2017年10月31日

Theoretical Foundations of t-SNE for Visualizing High-Dimensional Clustered Data

Arxiv

0+阅读 · 2021年5月18日

An SDE Framework for Adversarial Training, with Convergence and Robustness Analysis

Arxiv

0+阅读 · 2021年5月17日

Maximum Likelihood Estimation for Nets of Conics

Arxiv

0+阅读 · 2021年5月17日

Convergence guarantee for the sparse monotone single index model

Arxiv

0+阅读 · 2021年5月17日

Non-asymptotic bounds for stochastic optimization with biased noisy gradient oracles

Arxiv

0+阅读 · 2021年5月16日

Optimal control of robust team stochastic games

Arxiv

0+阅读 · 2021年5月16日

A Discrete-Time Switching System Analysis of Q-learning

Arxiv

0+阅读 · 2021年5月15日

Analysis of stochastic Lanczos quadrature for spectrum approximation

Arxiv

0+阅读 · 2021年5月13日

The Dynamics of Gradient Descent for Overparametrized Neural Networks

Arxiv

0+阅读 · 2021年5月13日

On the Explicit Role of Initialization on the Convergence and Implicit Bias of Overparametrized Linear Networks

On the Explicit Role of Initialization on the Convergence and Implicit Bias of Overparametrized Linear Networks

Arxiv

0+阅读 · 2021年5月13日

VIP会员

文章信息

相关主题

相关VIP内容

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【ICLR2021】微分动态规划神经优化器

专知会员服务

16+阅读 · 2021年3月4日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

【电子书】人工智能编程范式（Paradigms of Artificial Intelligence Programming）1048页PDF免费下载

【电子书】人工智能编程范式（Paradigms of Artificial Intelligence Programming）1048页PDF免费下载

专知会员服务

50+阅读 · 2019年10月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

深度卷积神经网络中的降采样

深度卷积神经网络中的降采样

极市平台

12+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

gan生成图像at 1024² 的代码论文

gan生成图像at 1024² 的代码论文

CreateAMind

4+阅读 · 2017年10月31日

相关论文

Theoretical Foundations of t-SNE for Visualizing High-Dimensional Clustered Data

Arxiv

0+阅读 · 2021年5月18日

An SDE Framework for Adversarial Training, with Convergence and Robustness Analysis

Arxiv

0+阅读 · 2021年5月17日

Maximum Likelihood Estimation for Nets of Conics

Arxiv

0+阅读 · 2021年5月17日

Convergence guarantee for the sparse monotone single index model

Arxiv

0+阅读 · 2021年5月17日

Non-asymptotic bounds for stochastic optimization with biased noisy gradient oracles

Arxiv

0+阅读 · 2021年5月16日

Optimal control of robust team stochastic games

Arxiv

0+阅读 · 2021年5月16日

A Discrete-Time Switching System Analysis of Q-learning

Arxiv

0+阅读 · 2021年5月15日

Analysis of stochastic Lanczos quadrature for spectrum approximation

Arxiv

0+阅读 · 2021年5月13日

The Dynamics of Gradient Descent for Overparametrized Neural Networks

Arxiv

0+阅读 · 2021年5月13日

On the Explicit Role of Initialization on the Convergence and Implicit Bias of Overparametrized Linear Networks

On the Explicit Role of Initialization on the Convergence and Implicit Bias of Overparametrized Linear Networks

Arxiv

0+阅读 · 2021年5月13日

微信扫码咨询专知VIP会员