膨胀的诅咒:通过优化实现差别化的比率 (The Curse of Unrolling: Rate of Differentiating Through Optimization) - 专知论文

会员服务 ·

0

优化器 · 雅克比 · Learning · 学习率 · Analysis ·

2022 年 9 月 27 日

The Curse of Unrolling: Rate of Differentiating Through Optimization

翻译：膨胀的诅咒:通过优化实现差别化的比率

Damien Scieur,Quentin Bertrand,Gauthier Gidel,Fabian Pedregosa

Computing the Jacobian of the solution of an optimization problem is a central problem in machine learning, with applications in hyperparameter optimization, meta-learning, optimization as a layer, and dataset distillation, to name a few. Unrolled differentiation is a popular heuristic that approximates the solution using an iterative solver and differentiates it through the computational path. This work provides a non-asymptotic convergence-rate analysis of this approach on quadratic objectives for gradient descent and the Chebyshev method. We show that to ensure convergence of the Jacobian, we can either 1) choose a large learning rate leading to a fast asymptotic convergence but accept that the algorithm may have an arbitrarily long burn-in phase or 2) choose a smaller learning rate leading to an immediate but slower convergence. We refer to this phenomenon as the curse of unrolling. Finally, we discuss open problems relative to this approach, such as deriving a practical update rule for the optimal unrolling strategy and making novel connections with the field of Sobolev orthogonal polynomials.

翻译：计算优化问题解决方案的 Jacobian 计算优化问题的方法是机器学习的一个中心问题。机器学习中应用超光度优化、元学习、优化为一层和数据集蒸馏, 仅举几个例子。无色化是一种流行的超常现象, 使用迭代解答器接近解决方案, 并通过计算路径将其区别开来。这项工作对关于梯度下移和Chebyshev 方法的二次目标的这一方法进行了非非非无色的趋同率分析。我们显示, 为确保雅各族的融合, 我们要么可以选择一个大学习率, 导致快速的零星趋同, 但接受算法可能会任意地长期燃烧, 或2 选择一个更小的学习率, 导致立即但更慢的趋同。我们将此现象称为解动的诅咒。最后, 我们讨论与这一方法有关的公开问题, 例如为最佳解动策略制定实用的更新规则, 以及与 Sobolev 或多式多式的字段建立新联系。

0

相关内容

优化器

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

过氧化物体增殖活化受体γ辅活化因子PGC-1α调控鸡前脂肪细胞线粒体发育和脂质沉积的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

蓖麻Geranylgeranyl Reductase酶在维生素E高效合成途径中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

柽柳Dof转录因子的耐盐调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

MOF/CNT/CTA表界面结构调控及复杂气体吸附机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

Lé过程和分数阶Lé过程驱动的动力系统的动力学性质研究

国家自然科学基金

0+阅读 · 2012年12月31日

人造规范势中冷原子的新奇量子态及动力学研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于HERG钾通道靶向性研究中药拯救LQTS的机制和调控位点

国家自然科学基金

0+阅读 · 2011年12月31日

熔融半导体材料的原子和电子结构与其宏观物理性质的相关性研究

国家自然科学基金

0+阅读 · 2009年12月31日

Mather理论与Hamilton系统的不稳定性

国家自然科学基金

0+阅读 · 2008年12月31日

Well-definedness of Physical Law Learning: The Uniqueness Problem

Arxiv

0+阅读 · 2022年11月3日

Polynomial Life: the Structure of Adaptive Systems

Arxiv

0+阅读 · 2022年11月3日

A machine learning approach for fighting the curse of dimensionality in global optimization

Arxiv

0+阅读 · 2022年11月3日

On the Analysis and Optimization of Fast Conditional Handover with Hand Blockage for Mobility

Arxiv

0+阅读 · 2022年11月1日

Where to start? Analyzing the potential value of intermediate models

Arxiv

0+阅读 · 2022年10月31日

The power of the Binary Value Principle

Arxiv

0+阅读 · 2022年10月31日

On the Global Convergence Rates of Decentralized Softmax Gradient Play in Markov Potential Games

Arxiv

0+阅读 · 2022年10月29日

Enhanced Energy-Saving Mechanisms in TSCH Networks for the IIoT: the PRIL Approach

Arxiv

0+阅读 · 2022年10月28日

Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization in e-Commercial Sponsored Search

Arxiv

12+阅读 · 2021年6月8日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

VIP会员

文章信息

相关主题

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Well-definedness of Physical Law Learning: The Uniqueness Problem

Arxiv

0+阅读 · 2022年11月3日

Polynomial Life: the Structure of Adaptive Systems

Arxiv

0+阅读 · 2022年11月3日

A machine learning approach for fighting the curse of dimensionality in global optimization

Arxiv

0+阅读 · 2022年11月3日

On the Analysis and Optimization of Fast Conditional Handover with Hand Blockage for Mobility

Arxiv

0+阅读 · 2022年11月1日

Where to start? Analyzing the potential value of intermediate models

Arxiv

0+阅读 · 2022年10月31日

The power of the Binary Value Principle

Arxiv

0+阅读 · 2022年10月31日

On the Global Convergence Rates of Decentralized Softmax Gradient Play in Markov Potential Games

Arxiv

0+阅读 · 2022年10月29日

Enhanced Energy-Saving Mechanisms in TSCH Networks for the IIoT: the PRIL Approach

Arxiv

0+阅读 · 2022年10月28日

Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization in e-Commercial Sponsored Search

Arxiv

12+阅读 · 2021年6月8日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

相关基金

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

过氧化物体增殖活化受体γ辅活化因子PGC-1α调控鸡前脂肪细胞线粒体发育和脂质沉积的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

蓖麻Geranylgeranyl Reductase酶在维生素E高效合成途径中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

柽柳Dof转录因子的耐盐调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

MOF/CNT/CTA表界面结构调控及复杂气体吸附机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

Lé过程和分数阶Lé过程驱动的动力系统的动力学性质研究

国家自然科学基金

0+阅读 · 2012年12月31日

人造规范势中冷原子的新奇量子态及动力学研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于HERG钾通道靶向性研究中药拯救LQTS的机制和调控位点

国家自然科学基金

0+阅读 · 2011年12月31日

熔融半导体材料的原子和电子结构与其宏观物理性质的相关性研究

国家自然科学基金

0+阅读 · 2009年12月31日

Mather理论与Hamilton系统的不稳定性

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员