连续观测关于内壳加速的存储式渐变源和随机流星流的连续视图 (A Continuized View on Nesterov Acceleration for Stochastic Gradient Descent and Randomized Gossip)

We introduce the continuized Nesterov acceleration, a close variant of Nesterov acceleration whose variables are indexed by a continuous time parameter. The two variables continuously mix following a linear ordinary differential equation and take gradient steps at random times. This continuized variant benefits from the best of the continuous and the discrete frameworks: as a continuous process, one can use differential calculus to analyze convergence and obtain analytical expressions for the parameters; and a discretization of the continuized process can be computed exactly with convergence rates similar to those of Nesterov original acceleration. We show that the discretization has the same structure as Nesterov acceleration, but with random parameters. We provide continuized Nesterov acceleration under deterministic as well as stochastic gradients, with either additive or multiplicative noise. Finally, using our continuized framework and expressing the gossip averaging problem as the stochastic minimization of a certain energy function, we provide the first rigorous acceleration of asynchronous gossip algorithms.

翻译：我们引入了内斯特罗夫加速度的紧凑变体, 即内斯特罗夫加速度的紧凑变体, 其变量由连续的时间参数索引。两个变体按照直线普通差分方程连续混合, 并在随机时间采取梯度步骤。这个相联变体从最好的连续和离散框架中获益: 作为一种连续过程, 可以使用不同的微积分来分析趋同, 并获得参数的分析表达方式; 与内斯特罗夫原加速度相似的趋同率可以精确地计算同的连结进程。我们显示离散化的结构与内斯特罗夫加速率相同, 但有随机参数。我们在确定性以及随机梯度梯度下提供内斯特罗夫加速度的相联配, 并且有添加或多倍复制性噪声。最后, 利用我们的contincult 框架, 表达八卦中的问题, 作为某种能量功能的随机最小化最小化最小化, 我们提供了非同步八象算法的首度加速度。

相关内容

Continuity

关注 4

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

近期必读的 NeurIPS2020 80多篇【图机器学习】相关论文

专知会员服务

54+阅读 · 2020年11月3日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日