Bilevel Optimization with a Lower-level Contraction: Optimal Sample Complexity without Warm-Start - 专知论文

会员服务 ·

0

优化器 · 样本复杂度 · 样本 · 平滑 · 元学习 ·

2023 年 5 月 30 日

Bilevel Optimization with a Lower-level Contraction: Optimal Sample Complexity without Warm-Start

翻译：暂无翻译

Riccardo Grazzi,Massimiliano Pontil,Saverio Salzo

from arxiv, Published in JMLR. Code at https://github.com/CSML-IIT-UCL/bioptexps

We analyse a general class of bilevel problems, in which the upper-level problem consists in the minimization of a smooth objective function and the lower-level problem is to find the fixed point of a smooth contraction map. This type of problems include instances of meta-learning, equilibrium models, hyperparameter optimization and data poisoning adversarial attacks. Several recent works have proposed algorithms which warm-start the lower-level problem, i.e. they use the previous lower-level approximate solution as a staring point for the lower-level solver. This warm-start procedure allows one to improve the sample complexity in both the stochastic and deterministic settings, achieving in some cases the order-wise optimal sample complexity. However, there are situations, e.g., meta learning and equilibrium models, in which the warm-start procedure is not well-suited or ineffective. In this work we show that without warm-start, it is still possible to achieve order-wise (near) optimal sample complexity. In particular, we propose a simple method which uses (stochastic) fixed point iterations at the lower-level and projected inexact gradient descent at the upper-level, that reaches an $\epsilon$-stationary point using $O(\epsilon^{-2})$ and $\tilde{O}(\epsilon^{-1})$ samples for the stochastic and the deterministic setting, respectively. Finally, compared to methods using warm-start, our approach yields a simpler analysis that does not need to study the coupled interactions between the upper-level and lower-level iterates.

翻译：暂无翻译

0

相关内容

优化器

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新六篇知识图谱相关论文—Zero-shot识别、卷积二维知识图谱、变分知识图谱推理、张量分解、推荐

【论文推荐】最新六篇知识图谱相关论文—Zero-shot识别、卷积二维知识图谱、变分知识图谱推理、张量分解、推荐

专知

50+阅读 · 2018年4月25日

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

专知

20+阅读 · 2018年4月7日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

易回收磷钼酸铵基高效Cs+捕集纳米复合材料的制备、表征与吸附机理

国家自然科学基金

0+阅读 · 2015年12月31日

IL-35在动脉粥样硬化进程中的作用和机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

C1型尼曼-匹克氏症轴突发育异常的病理机制

国家自然科学基金

0+阅读 · 2013年12月31日

过渡金属催化C(sp3)-H键氟化反应的研究

国家自然科学基金

0+阅读 · 2013年12月31日

同步辐射软X射线光束线光束位置无损在线监测技术的研究及其在纳米CT与透射扫描显微成像中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

低切应力诱导的miR-200c通过靶标SIRT1调节内皮细胞炎症反应

国家自然科学基金

0+阅读 · 2012年12月31日

组蛋白甲基化修饰调控拟南芥冷响应基因TCF1的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

荧光素类探针分子发光机理的研究

国家自然科学基金

0+阅读 · 2012年12月31日

Hedgehog信号通路调控宫颈癌上皮间质转化的作用及机制

国家自然科学基金

0+阅读 · 2011年12月31日

HAT/HDAC失衡与乙酰化修饰异常：急性肺损伤炎症失控新机制

国家自然科学基金

0+阅读 · 2009年12月31日

Alignment complete relational Hoare logics for some and all

Arxiv

0+阅读 · 2023年7月19日

Impatient Bandits: Optimizing for the Long-Term Without Delay

Arxiv

0+阅读 · 2023年7月19日

Planning with Dynamically Estimated Action Costs

Arxiv

0+阅读 · 2023年7月19日

Extension of Switch Point Algorithm to Boundary-Value Problems

Arxiv

0+阅读 · 2023年7月19日

Tight Lower Bounds for Weighted Matroid Problems

Arxiv

0+阅读 · 2023年7月15日

Optimal Mixing via Tensorization for Random Independent Sets on Arbitrary Trees

Arxiv

0+阅读 · 2023年7月15日

Unconstrained Online Learning with Unbounded Losses

Arxiv

0+阅读 · 2023年7月14日

Breaking the $3/4$ Barrier for Approximate Maximin Share

Arxiv

0+阅读 · 2023年7月14日

A combination technique for optimal control problems constrained by random PDEs

Arxiv

0+阅读 · 2023年7月14日

Self-Supervised Learning via Maximum Entropy Coding

Arxiv

13+阅读 · 2022年10月20日

VIP会员

文章信息

相关主题

样本复杂度

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《2024年度美国防部作战测试与评估报告》500页

《面相未来作战空中系统中有人-无人编组的AI驱动协作模式选择》含slides

无人机编队飞行：复杂环境中作战的策略、挑战与应用

《探索军事背景下共享大语言模型：AI助手与智能体部署中可扩展性与效率的早期洞察》（含44页slides）

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新六篇知识图谱相关论文—Zero-shot识别、卷积二维知识图谱、变分知识图谱推理、张量分解、推荐

【论文推荐】最新六篇知识图谱相关论文—Zero-shot识别、卷积二维知识图谱、变分知识图谱推理、张量分解、推荐

专知

50+阅读 · 2018年4月25日

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

专知

20+阅读 · 2018年4月7日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Alignment complete relational Hoare logics for some and all

Arxiv

0+阅读 · 2023年7月19日

Impatient Bandits: Optimizing for the Long-Term Without Delay

Arxiv

0+阅读 · 2023年7月19日

Planning with Dynamically Estimated Action Costs

Arxiv

0+阅读 · 2023年7月19日

Extension of Switch Point Algorithm to Boundary-Value Problems

Arxiv

0+阅读 · 2023年7月19日

Tight Lower Bounds for Weighted Matroid Problems

Arxiv

0+阅读 · 2023年7月15日

Optimal Mixing via Tensorization for Random Independent Sets on Arbitrary Trees

Arxiv

0+阅读 · 2023年7月15日

Unconstrained Online Learning with Unbounded Losses

Arxiv

0+阅读 · 2023年7月14日

Breaking the $3/4$ Barrier for Approximate Maximin Share

Arxiv

0+阅读 · 2023年7月14日

A combination technique for optimal control problems constrained by random PDEs

Arxiv

0+阅读 · 2023年7月14日

Self-Supervised Learning via Maximum Entropy Coding

Arxiv

13+阅读 · 2022年10月20日

相关基金

易回收磷钼酸铵基高效Cs+捕集纳米复合材料的制备、表征与吸附机理

国家自然科学基金

0+阅读 · 2015年12月31日

IL-35在动脉粥样硬化进程中的作用和机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

C1型尼曼-匹克氏症轴突发育异常的病理机制

国家自然科学基金

0+阅读 · 2013年12月31日

过渡金属催化C(sp3)-H键氟化反应的研究

国家自然科学基金

0+阅读 · 2013年12月31日

同步辐射软X射线光束线光束位置无损在线监测技术的研究及其在纳米CT与透射扫描显微成像中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

低切应力诱导的miR-200c通过靶标SIRT1调节内皮细胞炎症反应

国家自然科学基金

0+阅读 · 2012年12月31日

组蛋白甲基化修饰调控拟南芥冷响应基因TCF1的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

荧光素类探针分子发光机理的研究

国家自然科学基金

0+阅读 · 2012年12月31日

Hedgehog信号通路调控宫颈癌上皮间质转化的作用及机制

国家自然科学基金

0+阅读 · 2011年12月31日

HAT/HDAC失衡与乙酰化修饰异常：急性肺损伤炎症失控新机制

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员