曲率感知的无导数优化 (Curvature-Aware Derivative-Free Optimization) - 专知论文

会员服务 ·

0

CARS · 曲率 · 搜索 · 梯度 · 方向导数 ·

2023 年 4 月 12 日

Curvature-Aware Derivative-Free Optimization

翻译：曲率感知的无导数优化

Bumsu Kim,HanQin Cai,Daniel McKenzie,Wotao Yin

from arxiv, 31 pages, 9 figures

The paper discusses derivative-free optimization (DFO), which involves minimizing a function without access to gradients or directional derivatives, only function evaluations. Classical DFO methods, which mimic gradient-based methods, such as Nelder-Mead and direct search have limited scalability for high-dimensional problems. Zeroth-order methods have been gaining popularity due to the demands of large-scale machine learning applications, and the paper focuses on the selection of the step size $\alpha_k$ in these methods. The proposed approach, called Curvature-Aware Random Search (CARS), uses first- and second-order finite difference approximations to compute a candidate $\alpha_{+}$. We prove that for strongly convex objective functions, CARS converges linearly provided that the search direction is drawn from a distribution satisfying very mild conditions. We also present a Cubic Regularized variant of CARS, named CARS-CR, which converges in a rate of $\mathcal{O}(k^{-1})$ without the assumption of strong convexity. Numerical experiments show that CARS and CARS-CR match or exceed the state-of-the-arts on benchmark problem sets.

翻译：本文讨论无导数优化（DFO），其涉及在没有梯度或方向导数的情况下最小化函数，只有函数评估。类似于基于梯度的方法的经典DFO方法，如Nelder-Mead和直接搜索，对于高维问题的可扩展性有限。由于大规模机器学习应用的需求，零阶方法越来越受欢迎，本文重点关注这些方法中步长$\alpha_k$的选择。所提出的方法称为曲率感知随机搜索（CARS），使用一阶和二阶有限差分逼近来计算候选$\alpha_{+}$。我们证明，对于强凸目标函数，只要从一个满足非常温和条件的分布中抽取搜索方向，CARS就会线性收敛。我们还提出了CARS-CR的立方正则化变体，该变体无需假设强凸性即可以$\mathcal{O}(k^{-1})$的速度收敛。数值实验表明，CARS和CARS-CR与基准问题集的现有技术相当或超越了它们。

0

相关内容

CARS

【CTH博士论文】基于强化学习的自动驾驶决策，149页pdf

【CTH博士论文】基于强化学习的自动驾驶决策，149页pdf

专知会员服务

58+阅读 · 2023年2月18日

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

63+阅读 · 2023年2月15日

KDD 2022 | MolSearch: 基于搜索的多目标分子生成和性质优化

KDD 2022 | MolSearch: 基于搜索的多目标分子生成和性质优化

专知会员服务

4+阅读 · 2022年8月28日

南大《优化方法（Optimization Methods》课程，推荐！

南大《优化方法（Optimization Methods》课程，推荐！

专知会员服务

80+阅读 · 2022年4月3日

【NeurIPS2021】非凸从动件的基于梯度的双层优化

专知会员服务

13+阅读 · 2021年10月12日

【SIGIR2020】策略感知的无偏排序学习—Top-K排序，Policy-Aware Unbiased Learning to Rank for Top-𝑘 Rankings

【SIGIR2020】策略感知的无偏排序学习—Top-K排序，Policy-Aware Unbiased Learning to Rank for Top-𝑘 Rankings

专知会员服务

27+阅读 · 2020年6月10日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

博后招募 | 杜克大学医学院Ethan Fang课题组招募数据科学方向博士后

博后招募 | 杜克大学医学院Ethan Fang课题组招募数据科学方向博士后

PaperWeekly

0+阅读 · 2022年8月26日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

基于优化Schwarz算法的非线性预条件问题

国家自然科学基金

0+阅读 · 2015年12月31日

有理映射的参数空间

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

非凸稀疏先验图像恢复建模理论和算法

国家自然科学基金

0+阅读 · 2012年12月31日

稀疏网格谱方法及其在电子结构薛定谔方程上的应用

国家自然科学基金

0+阅读 · 2012年12月31日

基于移动网格的局部间断Galerkin有限元方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

轴对称的Navier-Stokes方程

国家自然科学基金

1+阅读 · 2011年12月31日

向量优化问题的集值分析与近似解研究

国家自然科学基金

0+阅读 · 2011年12月31日

矩阵分解的低延迟并行算法

国家自然科学基金

0+阅读 · 2009年12月31日

积分几何与凸几何分析

国家自然科学基金

2+阅读 · 2009年12月31日

Fast global convergence of gradient descent for low-rank matrix approximation

Arxiv

0+阅读 · 2023年5月30日

Asymptotic Characterisation of Robust Empirical Risk Minimisation Performance in the Presence of Outliers

Arxiv

0+阅读 · 2023年5月30日

On the Global Convergence of Risk-Averse Policy Gradient Methods with Expected Conditional Risk Measures

Arxiv

0+阅读 · 2023年5月30日

Optimal approximation of infinite-dimensional holomorphic functions

Arxiv

0+阅读 · 2023年5月29日

Early Stage Convergence and Global Convergence of Training Mildly Parameterized Neural Networks

Arxiv

0+阅读 · 2023年5月29日

Robust inference of causality in high-dimensional dynamical processes from the Information Imbalance of distance ranks

Arxiv

0+阅读 · 2023年5月27日

Alleviating Exposure Bias in Diffusion Models through Sampling with Shifted Time Steps

Arxiv

0+阅读 · 2023年5月26日

FARA: Future-aware Ranking Algorithm for Fairness Optimization

Arxiv

0+阅读 · 2023年5月26日

Computation of Reliability Statistics for Finite Samples of Success-Failure Experiments

Arxiv

0+阅读 · 2023年5月26日

Differentiable Dynamic Programming for Structured Prediction and Attention

Arxiv

56+阅读 · 2018年2月20日

VIP会员

文章信息

相关主题

相关VIP内容

【CTH博士论文】基于强化学习的自动驾驶决策，149页pdf

【CTH博士论文】基于强化学习的自动驾驶决策，149页pdf

专知会员服务

58+阅读 · 2023年2月18日

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

63+阅读 · 2023年2月15日

KDD 2022 | MolSearch: 基于搜索的多目标分子生成和性质优化

KDD 2022 | MolSearch: 基于搜索的多目标分子生成和性质优化

专知会员服务

4+阅读 · 2022年8月28日

南大《优化方法（Optimization Methods》课程，推荐！

南大《优化方法（Optimization Methods》课程，推荐！

专知会员服务

80+阅读 · 2022年4月3日

【NeurIPS2021】非凸从动件的基于梯度的双层优化

专知会员服务

13+阅读 · 2021年10月12日

【SIGIR2020】策略感知的无偏排序学习—Top-K排序，Policy-Aware Unbiased Learning to Rank for Top-𝑘 Rankings

【SIGIR2020】策略感知的无偏排序学习—Top-K排序，Policy-Aware Unbiased Learning to Rank for Top-𝑘 Rankings

专知会员服务

27+阅读 · 2020年6月10日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《美国海军陆战队软件定义网络应用案例：分布式防火墙自动化系统》148页

《多体环境下定位导航授时（PNT）系统研究》228页

软件定义无线电（SDR）：商业与军事领域的技术、应用及未来趋势

《攻势防空作战中无人追击者/规避者最优轨迹研究（含动态交战区建模）》95页

相关资讯

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

博后招募 | 杜克大学医学院Ethan Fang课题组招募数据科学方向博士后

博后招募 | 杜克大学医学院Ethan Fang课题组招募数据科学方向博士后

PaperWeekly

0+阅读 · 2022年8月26日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Fast global convergence of gradient descent for low-rank matrix approximation

Arxiv

0+阅读 · 2023年5月30日

Asymptotic Characterisation of Robust Empirical Risk Minimisation Performance in the Presence of Outliers

Arxiv

0+阅读 · 2023年5月30日

On the Global Convergence of Risk-Averse Policy Gradient Methods with Expected Conditional Risk Measures

Arxiv

0+阅读 · 2023年5月30日

Optimal approximation of infinite-dimensional holomorphic functions

Arxiv

0+阅读 · 2023年5月29日

Early Stage Convergence and Global Convergence of Training Mildly Parameterized Neural Networks

Arxiv

0+阅读 · 2023年5月29日

Robust inference of causality in high-dimensional dynamical processes from the Information Imbalance of distance ranks

Arxiv

0+阅读 · 2023年5月27日

Alleviating Exposure Bias in Diffusion Models through Sampling with Shifted Time Steps

Arxiv

0+阅读 · 2023年5月26日

FARA: Future-aware Ranking Algorithm for Fairness Optimization

Arxiv

0+阅读 · 2023年5月26日

Computation of Reliability Statistics for Finite Samples of Success-Failure Experiments

Arxiv

0+阅读 · 2023年5月26日

Differentiable Dynamic Programming for Structured Prediction and Attention

Arxiv

56+阅读 · 2018年2月20日

相关基金

基于优化Schwarz算法的非线性预条件问题

国家自然科学基金

0+阅读 · 2015年12月31日

有理映射的参数空间

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

非凸稀疏先验图像恢复建模理论和算法

国家自然科学基金

0+阅读 · 2012年12月31日

稀疏网格谱方法及其在电子结构薛定谔方程上的应用

国家自然科学基金

0+阅读 · 2012年12月31日

基于移动网格的局部间断Galerkin有限元方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

轴对称的Navier-Stokes方程

国家自然科学基金

1+阅读 · 2011年12月31日

向量优化问题的集值分析与近似解研究

国家自然科学基金

0+阅读 · 2011年12月31日

矩阵分解的低延迟并行算法

国家自然科学基金

0+阅读 · 2009年12月31日

积分几何与凸几何分析

国家自然科学基金

2+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员