具有近薄梯度甲骨文复杂度的可分解非移动 Convvex 优化化 (Decomposable Non-Smooth Convex Optimization with Nearly-Linear Gradient Oracle Complexity) - 专知论文

会员服务 ·

0

Oracle · 分解 · 优化器 · ICML 2021 · Lipschitz ·

2022 年 8 月 7 日

Decomposable Non-Smooth Convex Optimization with Nearly-Linear Gradient Oracle Complexity

翻译：具有近薄梯度甲骨文复杂度的可分解非移动 Convvex 优化化

Sally Dong,Haotian Jiang,Yin Tat Lee,Swati Padmanabhan,Guanghao Ye

Many fundamental problems in machine learning can be formulated by the convex program \[ \min_{\theta\in R^d}\ \sum_{i=1}^{n}f_{i}(\theta), \] where each $f_i$ is a convex, Lipschitz function supported on a subset of $d_i$ coordinates of $\theta$. One common approach to this problem, exemplified by stochastic gradient descent, involves sampling one $f_i$ term at every iteration to make progress. This approach crucially relies on a notion of uniformity across the $f_i$'s, formally captured by their condition number. In this work, we give an algorithm that minimizes the above convex formulation to $\epsilon$-accuracy in $\widetilde{O}(\sum_{i=1}^n d_i \log (1 /\epsilon))$ gradient computations, with no assumptions on the condition number. The previous best algorithm independent of the condition number is the standard cutting plane method, which requires $O(nd \log (1/\epsilon))$ gradient computations. As a corollary, we improve upon the evaluation oracle complexity for decomposable submodular minimization by Axiotis et al. (ICML 2021). Our main technical contribution is an adaptive procedure to select an $f_i$ term at every iteration via a novel combination of cutting-plane and interior-point methods.

翻译：机器学习中的许多基本问题可以通过 convex program 来制定, 也就是每个$_ i $是一个 convex, Lipschitz 函数在$\\ tal$的子集坐标上支持 lipschitz 函数。这个问题的一个常见方法, 以随机梯度下降为示例, 涉及在每次循环中取样一个 $f_ i 术语, 以取得进展。这个方法关键地依赖于美元美元和美元之间的统一概念, 由它们的条件编号正式捕获。在这个工作中, 我们给出一种算法, 将上述的 conexx 公式最小化成 $\ i 美元- i 美元在$\ 美元全局坐标 { O (\ sum_ i= 1\ d\ i log (1 /\ epsilon) 中支持的精确度。一种常见的方法, 在条件编号上没有假设。之前最好的算法是标准的平面值 $_ i$, i 美元, 以其条件值 listal decrial deal termal terminal termational decal cal termation ral ral procal) ral procal procal rocal rocal) roqualxxxxxxxxxxxxxx.

0

相关内容

Oracle

甲骨文公司，全称甲骨文股份有限公司(甲骨文软件系统有限公司)，是全球最大的企业级软件公司，总部位于美国加利福尼亚州的红木滩。1989年正式进入中国市场。2013年，甲骨文已超越 IBM ，成为继 Microsoft 后全球第二大软件公司。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【DeepMind最新报道】一行命令安装mujoco，再也不为环境折腾！[pip install mujoco]

【DeepMind最新报道】一行命令安装mujoco，再也不为环境折腾！[pip install mujoco]

深度强化学习实验室

0+阅读 · 2022年3月18日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

锂离子吸附剂Li/Al LDH-Cl 的可控合成、结构与吸附机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

重椭圆方程的弱有限元方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

随机偏微分方程多辛几何算法及不确定性量化

国家自然科学基金

0+阅读 · 2015年12月31日

HDAC调控在帕金森病发病机制中作用的实验研究

国家自然科学基金

0+阅读 · 2012年12月31日

Arisandilactone A 的不对称全合成

国家自然科学基金

0+阅读 · 2012年12月31日

一类四阶MEMS方程的解集结构与解的渐近性态

国家自然科学基金

0+阅读 · 2011年12月31日

硅光子学集成用Er silicate光波导放大器应用基础研究

国家自然科学基金

0+阅读 · 2009年12月31日

云南民族地区水质监测无线网状传感器网络跨层机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

分离压作用下含胶束结构的可溶性活性剂溶液铺展过程研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于EcR/USP结构的新型IGRs先导化合物的生物合理设计

国家自然科学基金

0+阅读 · 2008年12月31日

Scaling up Stochastic Gradient Descent for Non-convex Optimisation

Arxiv

0+阅读 · 2022年10月6日

Stochastic coordinate transformations with applications to robust machine learning

Stochastic coordinate transformations with applications to robust machine learning

Arxiv

0+阅读 · 2022年10月5日

The Complexity of Online Graph Games

Arxiv

0+阅读 · 2022年10月4日

Application of Stable Inversion to Flexible Manipulators Modeled by the ANCF

Arxiv

0+阅读 · 2022年10月4日

Linear Convergence of Natural Policy Gradient Methods with Log-Linear Policies

Arxiv

0+阅读 · 2022年10月4日

Lower Complexity Bounds of Finite-Sum Optimization Problems: The Results and Construction

Arxiv

0+阅读 · 2022年10月3日

Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning with Linear Function Approximation

Arxiv

0+阅读 · 2022年10月3日

Self-Stabilization: The Implicit Bias of Gradient Descent at the Edge of Stability

Arxiv

0+阅读 · 2022年9月30日

Optimal Query Complexities for Dynamic Trace Estimation

Arxiv

0+阅读 · 2022年9月30日

Strain energy density as a Gaussian process and its utilization in stochastic finite element analysis: application to planar soft tissues

Arxiv

0+阅读 · 2022年9月28日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【DeepMind最新报道】一行命令安装mujoco，再也不为环境折腾！[pip install mujoco]

【DeepMind最新报道】一行命令安装mujoco，再也不为环境折腾！[pip install mujoco]

深度强化学习实验室

0+阅读 · 2022年3月18日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Scaling up Stochastic Gradient Descent for Non-convex Optimisation

Arxiv

0+阅读 · 2022年10月6日

Stochastic coordinate transformations with applications to robust machine learning

Stochastic coordinate transformations with applications to robust machine learning

Arxiv

0+阅读 · 2022年10月5日

The Complexity of Online Graph Games

Arxiv

0+阅读 · 2022年10月4日

Application of Stable Inversion to Flexible Manipulators Modeled by the ANCF

Arxiv

0+阅读 · 2022年10月4日

Linear Convergence of Natural Policy Gradient Methods with Log-Linear Policies

Arxiv

0+阅读 · 2022年10月4日

Lower Complexity Bounds of Finite-Sum Optimization Problems: The Results and Construction

Arxiv

0+阅读 · 2022年10月3日

Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning with Linear Function Approximation

Arxiv

0+阅读 · 2022年10月3日

Self-Stabilization: The Implicit Bias of Gradient Descent at the Edge of Stability

Arxiv

0+阅读 · 2022年9月30日

Optimal Query Complexities for Dynamic Trace Estimation

Arxiv

0+阅读 · 2022年9月30日

Strain energy density as a Gaussian process and its utilization in stochastic finite element analysis: application to planar soft tissues

Arxiv

0+阅读 · 2022年9月28日

相关基金

锂离子吸附剂Li/Al LDH-Cl 的可控合成、结构与吸附机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

重椭圆方程的弱有限元方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

随机偏微分方程多辛几何算法及不确定性量化

国家自然科学基金

0+阅读 · 2015年12月31日

HDAC调控在帕金森病发病机制中作用的实验研究

国家自然科学基金

0+阅读 · 2012年12月31日

Arisandilactone A 的不对称全合成

国家自然科学基金

0+阅读 · 2012年12月31日

一类四阶MEMS方程的解集结构与解的渐近性态

国家自然科学基金

0+阅读 · 2011年12月31日

硅光子学集成用Er silicate光波导放大器应用基础研究

国家自然科学基金

0+阅读 · 2009年12月31日

云南民族地区水质监测无线网状传感器网络跨层机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

分离压作用下含胶束结构的可溶性活性剂溶液铺展过程研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于EcR/USP结构的新型IGRs先导化合物的生物合理设计

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员