未调整的Langevin算法:为神经网络创建稳定有效的适应算法 (Polygonal Unadjusted Langevin Algorithms: Creating stable and efficient adaptive algorithms for neural networks)

We present a new class of Langevin based algorithms, which overcomes many of the known shortcomings of popular adaptive optimizers that are currently used for the fine tuning of deep learning models. Its underpinning theory relies on recent advances of Euler's polygonal approximations for stochastic differential equations (SDEs) with monotone coefficients. As a result, it inherits the stability properties of tamed algorithms, while it addresses other known issues, e.g. vanishing gradients in neural networks. In particular, we provide a nonasymptotic analysis and full theoretical guarantees for the convergence properties of an algorithm of this novel class, which we named TH$\varepsilon$O POULA (or, simply, TheoPouLa). Finally, several experiments are presented with different types of deep learning models, which show the superior performance of TheoPouLa over many popular adaptive optimization algorithms.

翻译：我们提出了一个基于兰格文的新型算法,它克服了目前用于深层学习模型微调的流行适应性优化器的许多已知缺点。其基础理论依赖于最近Euler的多边形近似值的进步,即单调系数的微分方程(SDEs ) 。结果,它继承了调制算法的稳定性,同时它解决了其他已知问题,例如神经网络中的渐变消失。特别是,我们为这个小类(我们称之为TH$\varepsilon$O POOUULA (或者简单地说,TheoPouLA)的算法的趋同性提供了非补救性分析和充分的理论保障。最后,它用不同种类的深层次学习模型进行了一些实验,这些模型显示了TheoPouLa相对于许多流行的适应性优化算法的优异性表现。

相关内容

Neural Networks

关注 1649

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日