在斯托查斯托克优化中的噪音关联理论属性 (On the Theoretical Properties of Noise Correlation in Stochastic Optimization) - 专知论文

会员服务 ·

0

噪声 · 优化器 · Processing（编程语言） · Learning · 相关系数 ·

2022 年 9 月 19 日

On the Theoretical Properties of Noise Correlation in Stochastic Optimization

翻译：在斯托查斯托克优化中的噪音关联理论属性

Aurelien Lucchi,Frank Proske,Antonio Orvieto,Francis Bach,Hans Kersting

Studying the properties of stochastic noise to optimize complex non-convex functions has been an active area of research in the field of machine learning. Prior work has shown that the noise of stochastic gradient descent improves optimization by overcoming undesirable obstacles in the landscape. Moreover, injecting artificial Gaussian noise has become a popular idea to quickly escape saddle points. Indeed, in the absence of reliable gradient information, the noise is used to explore the landscape, but it is unclear what type of noise is optimal in terms of exploration ability. In order to narrow this gap in our knowledge, we study a general type of continuous-time non-Markovian process, based on fractional Brownian motion, that allows for the increments of the process to be correlated. This generalizes processes based on Brownian motion, such as the Ornstein-Uhlenbeck process. We demonstrate how to discretize such processes which gives rise to the new algorithm fPGD. This method is a generalization of the known algorithms PGD and Anti-PGD. We study the properties of fPGD both theoretically and empirically, demonstrating that it possesses exploration abilities that, in some cases, are favorable over PGD and Anti-PGD. These results open the field to novel ways to exploit noise for training machine learning models.

翻译：研究随机噪音的特性以优化复杂的非电流功能是机器学习领域一个积极的研究领域。先前的工作表明,通过克服景观中不受欢迎的障碍,随机梯度下降的噪音可以改善优化。此外,注射人工高斯噪音已成为一种流行的想法,可以迅速摆脱马鞍点。事实上,在缺乏可靠的梯度信息的情况下,噪音被用来探索景观,但从勘探能力方面看,什么类型的噪音是最佳的。为了缩小我们的知识差距,我们研究了一种基于分数布朗运动的连续时间非马可维进程的一般类型,这样可以使过程的递增相互挂钩。这种一般化过程是以布朗运动为基础的,例如Ornstein-Uhlenbeck进程。我们展示了如何将这种过程分解以产生新的算法FPGD。这种方法是已知的算法PGD和反PGD的概括化。我们研究了FPGD的理论和实验性特征,从理论和实验上证明它拥有探索能力,从而获得新的PGD模型。

0

相关内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

带有噪声扰动的动力系统分支问题研究

国家自然科学基金

0+阅读 · 2015年12月31日

蛙皮素样肽t-BBN介导的核壳型金磁性纳米粒对乳腺癌的CT和MRI靶向成像研究

国家自然科学基金

0+阅读 · 2013年12月31日

KLF5泛素化调节在人早期胚胎与胚胎干细胞中的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

E3泛素连接酶CUL7修饰Caspase-8调节乳腺癌细胞生存的研究

国家自然科学基金

0+阅读 · 2013年12月31日

MG53调节心脏辅助亚基KChIP2表达的分子机制及其在心脏电稳态调节中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

癌/睾丸抗原HCA587对转录因子NF-κB的调节作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

MST1与GAPDH相互作用及在心肌细胞凋亡中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

用dsDNA微阵列筛选NF-κDNA靶点及靶基因

国家自然科学基金

0+阅读 · 2008年12月31日

Provably Doubly Accelerated Federated Learning: The First Theoretically Successful Combination of Local Training and Compressed Communication

Provably Doubly Accelerated Federated Learning: The First Theoretically Successful Combination of Local Training and Compressed Communication

Arxiv

0+阅读 · 2022年10月27日

Large-scale Optimization of Partial AUC in a Range of False Positive Rates

Arxiv

0+阅读 · 2022年10月27日

Development of linear functional arithmetic and its application to solving problems of interval analysis

Development of linear functional arithmetic and its application to solving problems of interval analysis

Arxiv

0+阅读 · 2022年10月26日

Approximations for Generalized Unsplittable Flow on Paths with Application to Power Systems Optimization

Arxiv

0+阅读 · 2022年10月26日

Asymmetric predictability in causal discovery: an information theoretic approach

Arxiv

0+阅读 · 2022年10月26日

Bayesian mixture models (in)consistency for the number of clusters

Arxiv

0+阅读 · 2022年10月25日

The Stochastic Proximal Distance Algorithm

Arxiv

0+阅读 · 2022年10月25日

Stochastic Features of Purified Time Series

Arxiv

0+阅读 · 2022年10月24日

Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent Stochastic Approach

Arxiv

0+阅读 · 2022年10月23日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

VIP会员

文章信息

相关主题

Processing（编程语言）

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Provably Doubly Accelerated Federated Learning: The First Theoretically Successful Combination of Local Training and Compressed Communication

Provably Doubly Accelerated Federated Learning: The First Theoretically Successful Combination of Local Training and Compressed Communication

Arxiv

0+阅读 · 2022年10月27日

Large-scale Optimization of Partial AUC in a Range of False Positive Rates

Arxiv

0+阅读 · 2022年10月27日

Development of linear functional arithmetic and its application to solving problems of interval analysis

Development of linear functional arithmetic and its application to solving problems of interval analysis

Arxiv

0+阅读 · 2022年10月26日

Approximations for Generalized Unsplittable Flow on Paths with Application to Power Systems Optimization

Arxiv

0+阅读 · 2022年10月26日

Asymmetric predictability in causal discovery: an information theoretic approach

Arxiv

0+阅读 · 2022年10月26日

Bayesian mixture models (in)consistency for the number of clusters

Arxiv

0+阅读 · 2022年10月25日

The Stochastic Proximal Distance Algorithm

Arxiv

0+阅读 · 2022年10月25日

Stochastic Features of Purified Time Series

Arxiv

0+阅读 · 2022年10月24日

Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent Stochastic Approach

Arxiv

0+阅读 · 2022年10月23日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

相关基金

带有噪声扰动的动力系统分支问题研究

国家自然科学基金

0+阅读 · 2015年12月31日

蛙皮素样肽t-BBN介导的核壳型金磁性纳米粒对乳腺癌的CT和MRI靶向成像研究

国家自然科学基金

0+阅读 · 2013年12月31日

KLF5泛素化调节在人早期胚胎与胚胎干细胞中的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

E3泛素连接酶CUL7修饰Caspase-8调节乳腺癌细胞生存的研究

国家自然科学基金

0+阅读 · 2013年12月31日

MG53调节心脏辅助亚基KChIP2表达的分子机制及其在心脏电稳态调节中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

癌/睾丸抗原HCA587对转录因子NF-κB的调节作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

MST1与GAPDH相互作用及在心肌细胞凋亡中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

用dsDNA微阵列筛选NF-κDNA靶点及靶基因

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员