SketchySGD: 通过强力曲线估计实现可靠的斯托卡优化 (SketchySGD: Reliable Stochastic Optimization via Robust Curvature Estimates)

We introduce SketchySGD, a stochastic quasi-Newton method that uses sketching to approximate the curvature of the loss function. Quasi-Newton methods are among the most effective algorithms in traditional optimization, where they converge much faster than first-order methods such as SGD. However, for contemporary deep learning, quasi-Newton methods are considered inferior to first-order methods like SGD and Adam owing to higher per-iteration complexity and fragility due to inexact gradients. SketchySGD circumvents these issues by a novel combination of subsampling, randomized low-rank approximation, and dynamic regularization. In the convex case, we show SketchySGD with a fixed stepsize converges to a small ball around the optimum at a faster rate than SGD for ill-conditioned problems. In the non-convex case, SketchySGD converges linearly under two additional assumptions, interpolation and the Polyak-Lojaciewicz condition, the latter of which holds with high probability for wide neural networks. Numerical experiments on image and tabular data demonstrate the improved reliability and speed of SketchySGD for deep learning, compared to standard optimizers such as SGD and Adam and existing quasi-Newton methods.

翻译：我们引入了SketsySGD(SketchySGD),这是一种使用草图来近似损失函数曲线曲线的准牛顿方法。Quasi-Newton方法是传统优化中最有效的算法之一,比SGD(SGD)等一级方法要快得多。然而,对于当代深层次学习而言,准牛顿方法被认为比SGD(SGD)和Adam(Adam)等一级方法低,因为不切实际的梯度导致的渗透复杂性和脆弱性较高。SketsySGD(SketsychySGD)绕过这些问题,将次抽样抽样、随机低级近似和动态正规化合在一起。在 convex案中,我们展示SketsychySGD(SketsychySGD)以比SGD(SGD)等一级方法最优化,速度比SGD(SGD)更快。在非康克斯(Sketsionx)案中,Sketsychychyal 实验和Sligard Sligard Statal 的当前标准数据展示了可靠性和SligardSGD(Sq)的改进速度。

相关内容

拟牛顿法

关注 1

拟牛顿法(Quasi-Newton Methods)是求解非线性优化问题最有效的方法之一，于20世纪50年代由美国Argonne国家实验室的物理学家W. C. Davidon所提出来。Davidon设计的这种算法在当时看来是非线性优化领域最具创造性的发明之一。不久R. Fletcher和M. J. D. Powell证实了这种新的算法远比其他方法快速和可靠，使得非线性优化这门学科在一夜之间突飞猛进。

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日