携带重尾噪声的随机非光滑凸优化 (Stochastic Nonsmooth Convex Optimization with Heavy-Tailed Noises) - 专知论文

会员服务 ·

0

非光滑 · 噪声 · 最优 · 梯度 · 参数选择 ·

2023 年 3 月 25 日

Stochastic Nonsmooth Convex Optimization with Heavy-Tailed Noises

翻译：携带重尾噪声的随机非光滑凸优化

Zijian Liu,Zhengyuan Zhou

Recently, several studies consider the stochastic optimization problem but in a heavy-tailed noise regime, i.e., the difference between the stochastic gradient and the true gradient is assumed to have a finite $p$-th moment (say being upper bounded by $\sigma^{p}$ for some $\sigma\geq0$) where $p\in(1,2]$, which not only generalizes the traditional finite variance assumption ($p=2$) but also has been observed in practice for several different tasks. Under this challenging assumption, lots of new progress has been made for either convex or nonconvex problems, however, most of which only consider smooth objectives. In contrast, people have not fully explored and well understood this problem when functions are nonsmooth. This paper aims to fill this crucial gap by providing a comprehensive analysis of stochastic nonsmooth convex optimization with heavy-tailed noises. We revisit a simple clipping-based algorithm, whereas, which is only proved to converge in expectation but under the additional strong convexity assumption. Under appropriate choices of parameters, for both convex and strongly convex functions, we not only establish the first high-probability rates but also give refined in-expectation bounds compared with existing works. Remarkably, all of our results are optimal (or nearly optimal up to logarithmic factors) with respect to the time horizon $T$ even when $T$ is unknown in advance. Additionally, we show how to make the algorithm parameter-free with respect to $\sigma$, in other words, the algorithm can still guarantee convergence without any prior knowledge of $\sigma$.

翻译：最近，一些研究考虑了重尾噪声条件下的随机优化问题，即将随机梯度与真实梯度之间的差异假设为具有有限的第$p$阶矩（例如被上界$\sigma^{p}$限制，其中$p\in(1,2]$），这不仅推广了传统的有限方差假设（$p=2$），而且在不同的任务中经常被观察到。在这种具有挑战性的假设下，针对凸性和非凸性问题已经取得了不少新进展，但是，大多数仅考虑光滑目标。相比之下，在函数不光滑的情况下此问题尚未得到充分的探索和了解。本文旨在提供对于携带重尾噪声的随机非光滑凸优化的全面分析。我们重新审视了一个简单的基于截断的算法，其中仅被证明收敛于期望之中，但需要额外的强凸假设。在适当的参数选择下，对于凸和强凸函数，我们不仅建立了首个高概率速率，而且在期望方面相较现有研究给出了更精细的界限。显著的是，我们的所有结果对于时间跨度$T$都是最优的（或者几乎最优的，最多对数因子），即使$T$事先不知道。此外，我们展示了如何使算法与$\sigma$无关，也就是说，该算法可以在不需要任何关于$\sigma$的先验知识的情况下保证收敛。

0

相关内容

非光滑

【2023新书】随机模型基础，815页pdf

【2023新书】随机模型基础，815页pdf

专知会员服务

104+阅读 · 2023年5月10日

【干货书】工程和科学中的概率和统计，

【干货书】工程和科学中的概率和统计，

专知会员服务

58+阅读 · 2022年12月24日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

南大《优化方法（Optimization Methods》课程，推荐！

南大《优化方法（Optimization Methods》课程，推荐！

专知会员服务

80+阅读 · 2022年4月3日

【经典书】凸优化：算法与复杂度，130页pdf

【经典书】凸优化：算法与复杂度，130页pdf

专知会员服务

81+阅读 · 2021年11月16日

【干货书】鲁棒优化Robust Optimization，570页pdf

专知会员服务

144+阅读 · 2021年3月17日

【博士论文】机器学习中部分非凸和随机优化算法研究

专知会员服务

75+阅读 · 2020年12月7日

【斯坦福】凸优化圣经- Convex Optimization （附730pdf下载）

【斯坦福】凸优化圣经- Convex Optimization （附730pdf下载）

专知会员服务

229+阅读 · 2020年6月5日

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

专知会员服务

102+阅读 · 2019年12月9日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

0+阅读 · 2022年6月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

分享神经网络中设计loss function的一些技巧

分享神经网络中设计loss function的一些技巧

极市平台

35+阅读 · 2019年1月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

博客 | 机器学习中的数学基础（凸优化）

博客 | 机器学习中的数学基础（凸优化）

AI研习社

14+阅读 · 2018年12月16日

AI/ML/DNN硬件加速设计怎么入门？

AI/ML/DNN硬件加速设计怎么入门？

StarryHeavensAbove

11+阅读 · 2018年12月4日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

基于惯性/嵌入式大气数据系统的火星进入段自主导航方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

Kahler 曲面中特殊曲面的研究

国家自然科学基金

0+阅读 · 2014年12月31日

神经网络随机学习算法的泛化性研究

国家自然科学基金

2+阅读 · 2013年12月31日

随机广义方程相对于概率分布的稳定性分析及应用

国家自然科学基金

1+阅读 · 2012年12月31日

递推局部多项式回归估计及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

集值优化问题的定性分析和定量分析

国家自然科学基金

0+阅读 · 2012年12月31日

随机变分不等式

国家自然科学基金

0+阅读 · 2011年12月31日

模-相对Hochschild同调与上同调

国家自然科学基金

0+阅读 · 2011年12月31日

一类有限混合半参数时间序列模型的研究

国家自然科学基金

0+阅读 · 2009年12月31日

神经网络子空间学习算法的收敛性与鲁棒性

国家自然科学基金

1+阅读 · 2009年12月31日

Projection-Free Online Convex Optimization with Stochastic Constraints

Arxiv

0+阅读 · 2023年5月16日

Mixed Laplace approximation for marginal posterior and Bayesian inference in error-in-operator model

Arxiv

0+阅读 · 2023年5月16日

Convex optimization over a probability simplex

Arxiv

0+阅读 · 2023年5月15日

Bayesian inference for misspecified generative models

Arxiv

0+阅读 · 2023年5月15日

Uniform-PAC Guarantees for Model-Based RL with Bounded Eluder Dimension

Arxiv

0+阅读 · 2023年5月15日

Accelerated Single-Call Methods for Constrained Min-Max Optimization

Arxiv

0+阅读 · 2023年5月14日

Federated TD Learning over Finite-Rate Erasure Channels: Linear Speedup under Markovian Sampling

Arxiv

0+阅读 · 2023年5月14日

Evaluating the Robustness of Interpretability Methods through Explanation Invariance and Equivariance

Arxiv

0+阅读 · 2023年5月12日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Aleatoric and Epistemic Uncertainty in Machine Learning: An Introduction to Concepts and Methods

Aleatoric and Epistemic Uncertainty in Machine Learning: An Introduction to Concepts and Methods

Arxiv

15+阅读 · 2020年4月3日

VIP会员

文章信息

相关主题

相关VIP内容

【2023新书】随机模型基础，815页pdf

【2023新书】随机模型基础，815页pdf

专知会员服务

104+阅读 · 2023年5月10日

【干货书】工程和科学中的概率和统计，

【干货书】工程和科学中的概率和统计，

专知会员服务

58+阅读 · 2022年12月24日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

南大《优化方法（Optimization Methods》课程，推荐！

南大《优化方法（Optimization Methods》课程，推荐！

专知会员服务

80+阅读 · 2022年4月3日

【经典书】凸优化：算法与复杂度，130页pdf

【经典书】凸优化：算法与复杂度，130页pdf

专知会员服务

81+阅读 · 2021年11月16日

【干货书】鲁棒优化Robust Optimization，570页pdf

专知会员服务

144+阅读 · 2021年3月17日

【博士论文】机器学习中部分非凸和随机优化算法研究

专知会员服务

75+阅读 · 2020年12月7日

【斯坦福】凸优化圣经- Convex Optimization （附730pdf下载）

【斯坦福】凸优化圣经- Convex Optimization （附730pdf下载）

专知会员服务

229+阅读 · 2020年6月5日

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

专知会员服务

102+阅读 · 2019年12月9日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

0+阅读 · 2022年6月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

分享神经网络中设计loss function的一些技巧

分享神经网络中设计loss function的一些技巧

极市平台

35+阅读 · 2019年1月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

博客 | 机器学习中的数学基础（凸优化）

博客 | 机器学习中的数学基础（凸优化）

AI研习社

14+阅读 · 2018年12月16日

AI/ML/DNN硬件加速设计怎么入门？

AI/ML/DNN硬件加速设计怎么入门？

StarryHeavensAbove

11+阅读 · 2018年12月4日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

相关论文

Projection-Free Online Convex Optimization with Stochastic Constraints

Arxiv

0+阅读 · 2023年5月16日

Mixed Laplace approximation for marginal posterior and Bayesian inference in error-in-operator model

Arxiv

0+阅读 · 2023年5月16日

Convex optimization over a probability simplex

Arxiv

0+阅读 · 2023年5月15日

Bayesian inference for misspecified generative models

Arxiv

0+阅读 · 2023年5月15日

Uniform-PAC Guarantees for Model-Based RL with Bounded Eluder Dimension

Arxiv

0+阅读 · 2023年5月15日

Accelerated Single-Call Methods for Constrained Min-Max Optimization

Arxiv

0+阅读 · 2023年5月14日

Federated TD Learning over Finite-Rate Erasure Channels: Linear Speedup under Markovian Sampling

Arxiv

0+阅读 · 2023年5月14日

Evaluating the Robustness of Interpretability Methods through Explanation Invariance and Equivariance

Arxiv

0+阅读 · 2023年5月12日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Aleatoric and Epistemic Uncertainty in Machine Learning: An Introduction to Concepts and Methods

Aleatoric and Epistemic Uncertainty in Machine Learning: An Introduction to Concepts and Methods

Arxiv

15+阅读 · 2020年4月3日

相关基金

基于惯性/嵌入式大气数据系统的火星进入段自主导航方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

Kahler 曲面中特殊曲面的研究

国家自然科学基金

0+阅读 · 2014年12月31日

神经网络随机学习算法的泛化性研究

国家自然科学基金

2+阅读 · 2013年12月31日

随机广义方程相对于概率分布的稳定性分析及应用

国家自然科学基金

1+阅读 · 2012年12月31日

递推局部多项式回归估计及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

集值优化问题的定性分析和定量分析

国家自然科学基金

0+阅读 · 2012年12月31日

随机变分不等式

国家自然科学基金

0+阅读 · 2011年12月31日

模-相对Hochschild同调与上同调

国家自然科学基金

0+阅读 · 2011年12月31日

一类有限混合半参数时间序列模型的研究

国家自然科学基金

0+阅读 · 2009年12月31日

神经网络子空间学习算法的收敛性与鲁棒性

国家自然科学基金

1+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员