病态正则化主双模随机极小化算法 (Stochastic regularized majorization-minimization with weakly convex and multi-convex surrogates) - 专知论文

会员服务 ·

0

正则化项 · 情景 · 优化器 · 非凸 · 泛函 ·

2023 年 3 月 21 日

Stochastic regularized majorization-minimization with weakly convex and multi-convex surrogates

翻译：病态正则化主双模随机极小化算法

from arxiv, 64 pages, 5 figures, 1 table

Stochastic majorization-minimization (SMM) is a class of stochastic optimization algorithms that proceed by sampling new data points and minimizing a recursive average of surrogate functions of an objective function. The surrogates are required to be strongly convex and convergence rate analysis for the general non-convex setting was not available. In this paper, we propose an extension of SMM where surrogates are allowed to be only weakly convex or block multi-convex, and the averaged surrogates are approximately minimized with proximal regularization or block-minimized within diminishing radii, respectively. For the general nonconvex constrained setting with non-i.i.d. data samples, we show that the first-order optimality gap of the proposed algorithm decays at the rate $O((\log n)^{1+\epsilon}/n^{1/2})$ for the empirical loss and $O((\log n)^{1+\epsilon}/n^{1/4})$ for the expected loss, where $n$ denotes the number of data samples processed. Under some additional assumption, the latter convergence rate can be improved to $O((\log n)^{1+\epsilon}/n^{1/2})$. As a corollary, we obtain the first convergence rate bounds for various optimization methods under general nonconvex dependent data setting: Double-averaging projected gradient descent and its generalizations, proximal point empirical risk minimization, and online matrix/tensor decomposition algorithms. We also provide experimental validation of our results.

翻译：随机主双模极小化（SMM）是一类随机优化算法，其通过采样新的数据点并最小化目标函数的伪损失函数的递归平均值来进行。伪损失函数需要是强凸的，而一般非凸情况下的收敛率分析尚不可行。本文提出一种SMM的扩展：伪损失函数仅需为弱凸或块双凸，并通过近端正则化或在逐渐缩小的半径范围内进行块最小化来近似最小化平均伪损失函数。对于具有非独立同分布数据样本的一般非凸约束情况，我们证明了所提算法的一阶最优性差可以在经验损失和期望损失分别达到$O((\log n)^{1+\epsilon}/n^{1/2})$和$O((\log n)^{1+\epsilon}/n^{1/4})$的收敛速率，其中$n$表示处理的数据样本数量。在某些额外假设下，后一种收敛速率可以提高到$O((\log n)^{1+\epsilon}/n^{1/2})$。作为推论，我们得到了各种在一般非凸相关数据设定下的优化方法的首个收敛率界：双平均投影梯度下降及其扩展方法、近端点经验风险最小化，以及在线矩阵/张量分解算法。我们还提供了实验验证。

0

相关内容

正则化项

【新书】机器学习凸优化，379页pdf，Convex Optimization for Machine Learning

【新书】机器学习凸优化，379页pdf，Convex Optimization for Machine Learning

专知会员服务

149+阅读 · 2022年12月18日

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

【港科大Yunfei Yang博士论文】生成式对抗网络的分布学习:近似与泛化

【港科大Yunfei Yang博士论文】生成式对抗网络的分布学习:近似与泛化

专知会员服务

34+阅读 · 2022年5月29日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

专知会员服务

19+阅读 · 2020年6月29日

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

专知会员服务

134+阅读 · 2020年4月14日

【NeurIPS 2019|经典论文奖】正则随机学习和在线优化的双重平均法（Dual Averaging Method for Regularized Stochastic Learning and Online Optimization），微软研究院Lin Xiao

【NeurIPS 2019|经典论文奖】正则随机学习和在线优化的双重平均法（Dual Averaging Method for Regularized Stochastic Learning and Online Optimization），微软研究院Lin Xiao

专知会员服务

17+阅读 · 2019年12月9日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

泡泡机器人SLAM

25+阅读 · 2019年1月17日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

专知

13+阅读 · 2018年1月23日

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

基于混合约束正则化的电阻抗成像反演研究

国家自然科学基金

0+阅读 · 2015年12月31日

随机波动率模型的统计推断及数值解

国家自然科学基金

1+阅读 · 2015年12月31日

带粗糙系数的高阶微分算子的若干研究

国家自然科学基金

0+阅读 · 2013年12月31日

约束Lp正则化问题算法及应用

国家自然科学基金

0+阅读 · 2012年12月31日

无穷维动力系统的随机小扰动

国家自然科学基金

0+阅读 · 2012年12月31日

随机偏微分方程快速高精度算法

国家自然科学基金

0+阅读 · 2012年12月31日

热传导方程的时间最优控制与范数最优控制

国家自然科学基金

0+阅读 · 2011年12月31日

遍历哈密顿系统的谱理论

国家自然科学基金

0+阅读 · 2009年12月31日

随机微分方程的逼近

国家自然科学基金

0+阅读 · 2009年12月31日

Convergence of Alternating Gradient Descent for Matrix Factorization

Arxiv

0+阅读 · 2023年5月11日

Stochastic Variance-Reduced Majorization-Minimization Algorithms

Arxiv

0+阅读 · 2023年5月11日

Two new algorithms for maximum likelihood estimation of sparse covariance matrices with applications to graphical modeling

Arxiv

0+阅读 · 2023年5月11日

Active Learning in the Predict-then-Optimize Framework: A Margin-Based Approach

Arxiv

0+阅读 · 2023年5月11日

Optimally-Weighted Estimators of the Maximum Mean Discrepancy for Likelihood-Free Inference

Arxiv

0+阅读 · 2023年5月10日

Convergence of a Normal Map-based Prox-SGD Method under the KL Inequality

Arxiv

0+阅读 · 2023年5月10日

'Put the Car on the Stand': SMT-based Oracles for Investigating Decisions

Arxiv

0+阅读 · 2023年5月9日

UAdam: Unified Adam-Type Algorithmic Framework for Non-Convex Stochastic Optimization

Arxiv

0+阅读 · 2023年5月9日

Random Algebraic Graphs and Their Convergence to Erdos-Renyi

Arxiv

0+阅读 · 2023年5月9日

A Survey on Multi-Task Learning

Arxiv

31+阅读 · 2021年3月29日

VIP会员

文章信息

相关主题

相关VIP内容

【新书】机器学习凸优化，379页pdf，Convex Optimization for Machine Learning

【新书】机器学习凸优化，379页pdf，Convex Optimization for Machine Learning

专知会员服务

149+阅读 · 2022年12月18日

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

【港科大Yunfei Yang博士论文】生成式对抗网络的分布学习:近似与泛化

【港科大Yunfei Yang博士论文】生成式对抗网络的分布学习:近似与泛化

专知会员服务

34+阅读 · 2022年5月29日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

专知会员服务

19+阅读 · 2020年6月29日

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

【UIUC硬核书】统计学习理论，Statistical Learning Theory，213页pdf

专知会员服务

134+阅读 · 2020年4月14日

【NeurIPS 2019|经典论文奖】正则随机学习和在线优化的双重平均法（Dual Averaging Method for Regularized Stochastic Learning and Online Optimization），微软研究院Lin Xiao

【NeurIPS 2019|经典论文奖】正则随机学习和在线优化的双重平均法（Dual Averaging Method for Regularized Stochastic Learning and Online Optimization），微软研究院Lin Xiao

专知会员服务

17+阅读 · 2019年12月9日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

扩散语言模型综述

《美陆军徒步机动作战条令手册》最新168页

【博士论文】理解神经网络的训练动态：从局部优化轨迹与特征学习视角

军事后勤数字化未来展望

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

泡泡机器人SLAM

25+阅读 · 2019年1月17日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

专知

13+阅读 · 2018年1月23日

相关论文

Convergence of Alternating Gradient Descent for Matrix Factorization

Arxiv

0+阅读 · 2023年5月11日

Stochastic Variance-Reduced Majorization-Minimization Algorithms

Arxiv

0+阅读 · 2023年5月11日

Two new algorithms for maximum likelihood estimation of sparse covariance matrices with applications to graphical modeling

Arxiv

0+阅读 · 2023年5月11日

Active Learning in the Predict-then-Optimize Framework: A Margin-Based Approach

Arxiv

0+阅读 · 2023年5月11日

Optimally-Weighted Estimators of the Maximum Mean Discrepancy for Likelihood-Free Inference

Arxiv

0+阅读 · 2023年5月10日

Convergence of a Normal Map-based Prox-SGD Method under the KL Inequality

Arxiv

0+阅读 · 2023年5月10日

'Put the Car on the Stand': SMT-based Oracles for Investigating Decisions

Arxiv

0+阅读 · 2023年5月9日

UAdam: Unified Adam-Type Algorithmic Framework for Non-Convex Stochastic Optimization

Arxiv

0+阅读 · 2023年5月9日

Random Algebraic Graphs and Their Convergence to Erdos-Renyi

Arxiv

0+阅读 · 2023年5月9日

A Survey on Multi-Task Learning

Arxiv

31+阅读 · 2021年3月29日

相关基金

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

基于混合约束正则化的电阻抗成像反演研究

国家自然科学基金

0+阅读 · 2015年12月31日

随机波动率模型的统计推断及数值解

国家自然科学基金

1+阅读 · 2015年12月31日

带粗糙系数的高阶微分算子的若干研究

国家自然科学基金

0+阅读 · 2013年12月31日

约束Lp正则化问题算法及应用

国家自然科学基金

0+阅读 · 2012年12月31日

无穷维动力系统的随机小扰动

国家自然科学基金

0+阅读 · 2012年12月31日

随机偏微分方程快速高精度算法

国家自然科学基金

0+阅读 · 2012年12月31日

热传导方程的时间最优控制与范数最优控制

国家自然科学基金

0+阅读 · 2011年12月31日

遍历哈密顿系统的谱理论

国家自然科学基金

0+阅读 · 2009年12月31日

随机微分方程的逼近

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员