夏普-MAML: 锐利-智能智能模型-不可知性元学习 (Sharp-MAML: Sharpness-Aware Model-Agnostic Meta Learning) - 专知论文

会员服务 ·

0

MAML · Learning · 元学习 · 经验风险最小化 · Analysis ·

2022 年 8 月 7 日

Sharp-MAML: Sharpness-Aware Model-Agnostic Meta Learning

翻译：夏普-MAML: 锐利-智能智能模型-不可知性元学习

Momin Abbas,Quan Xiao,Lisha Chen,Pin-Yu Chen,Tianyi Chen

from arxiv, accepted to ICML 2022

Model-agnostic meta learning (MAML) is currently one of the dominating approaches for few-shot meta-learning. Albeit its effectiveness, the optimization of MAML can be challenging due to the innate bilevel problem structure. Specifically, the loss landscape of MAML is much more complex with possibly more saddle points and local minimizers than its empirical risk minimization counterpart. To address this challenge, we leverage the recently invented sharpness-aware minimization and develop a sharpness-aware MAML approach that we term Sharp-MAML. We empirically demonstrate that Sharp-MAML and its computation-efficient variant can outperform the plain-vanilla MAML baseline (e.g., $+3\%$ accuracy on Mini-Imagenet). We complement the empirical study with the convergence rate analysis and the generalization bound of Sharp-MAML. To the best of our knowledge, this is the first empirical and theoretical study on sharpness-aware minimization in the context of bilevel learning. The code is available at https://github.com/mominabbass/Sharp-MAML.

翻译：模型-不可知元学习(MAML)目前是少数元元学习的主导方法之一。尽管其有效性是有效的,但优化MAML可能由于内在的双层问题结构而具有挑战性。具体地说,MAML的丧失面貌比其经验性风险最小化的对口单位更为复杂,可能有更多的搭载点和当地最小化因素。为了应对这一挑战,我们利用最近发明的敏锐-觉悟最小化(MAML),并开发了一种我们称为Sharp-MAML的敏锐-觉悟-觉悟MAML方法。我们从经验上表明,Sharp-MAML及其计算效率变异体能够超过普通-香草MAML的基线(例如迷你-IMLnet的精度 $+3 $+ $ $ $ $ $ $ $ $) 。我们对经验性研究的补充是趋同率分析以及夏普- MAML的概括性约束。我们最了解的是,这是关于双级学习中精度最小化-觉最小化的第一次经验和理论研究。该代码可在http://github.com/mominabass/Sharp-mabass/Sharp-MAML.

0

相关内容

MAML

MAML（Model-Agnostic Meta-Learning）是元学习（Meta learning）最经典的几个算法之一，出自论文《Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks》。原文地址：https://arxiv.org/abs/1703.03400

【ICML2022】Sharp-MAML:锐度感知的模型无关元学习

【ICML2022】Sharp-MAML:锐度感知的模型无关元学习

专知会员服务

17+阅读 · 2022年6月10日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【ICML2022】Sharp-MAML:锐度感知的模型无关元学习

【ICML2022】Sharp-MAML:锐度感知的模型无关元学习

专知

1+阅读 · 2022年6月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知

133+阅读 · 2020年3月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

氧、氯负离子在氢等离子体中输运过程研究

国家自然科学基金

0+阅读 · 2015年12月31日

星型胶质细胞参与癫痫发作的动力学机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

长链非编码RNA ENST00000454471对关键基因表达的调控在结直肠癌发生发展中的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

基于低碳经济的能源价格体系动态优化理论与实证研究

国家自然科学基金

0+阅读 · 2012年12月31日

大气压直流微等离子体辅助液相合成和液相修饰纳米粒子研究

国家自然科学基金

0+阅读 · 2012年12月31日

电子束产生开放空间等离子体物理过程研究

国家自然科学基金

0+阅读 · 2012年12月31日

等离子体层顶位形统计研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于周期自适应控制的飞行控制方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于CJT方法和朗道相变理论对强作用物质相变机制的研究

国家自然科学基金

0+阅读 · 2009年12月31日

SimPer: Simple Self-Supervised Learning of Periodic Targets

Arxiv

0+阅读 · 2022年10月6日

SynBench: Task-Agnostic Benchmarking of Pretrained Representations using Synthetic Data

Arxiv

0+阅读 · 2022年10月6日

Meta-Ensemble Parameter Learning

Arxiv

0+阅读 · 2022年10月5日

The Dynamics of Sharpness-Aware Minimization: Bouncing Across Ravines and Drifting Towards Wide Minima

Arxiv

0+阅读 · 2022年10月4日

PersA-FL: Personalized Asynchronous Federated Learning

Arxiv

0+阅读 · 2022年10月3日

Probabilistic Metamodels for an Efficient Characterization of Complex Driving Scenarios

Arxiv

0+阅读 · 2022年9月30日

PL-kNN: A Parameterless Nearest Neighbors Classifier

Arxiv

0+阅读 · 2022年9月30日

Adaptive Methods for Real-World Domain Generalization

Arxiv

13+阅读 · 2021年3月29日

Learning from Few Samples: A Survey

Learning from Few Samples: A Survey

Arxiv

77+阅读 · 2020年7月30日

Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation

Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation

Arxiv

25+阅读 · 2019年10月30日

VIP会员

文章信息

相关主题

经验风险最小化

相关VIP内容

【ICML2022】Sharp-MAML:锐度感知的模型无关元学习

【ICML2022】Sharp-MAML:锐度感知的模型无关元学习

专知会员服务

17+阅读 · 2022年6月10日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

新质生成式AI赋能产业变革的实践与路径

用于多模态大模型的离散标记化：全面综述

Nature综述：金融网络中的物理学

【CMU博士论文】通信高效且差分隐私的优化方法

相关资讯

【ICML2022】Sharp-MAML:锐度感知的模型无关元学习

【ICML2022】Sharp-MAML:锐度感知的模型无关元学习

专知

1+阅读 · 2022年6月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知

133+阅读 · 2020年3月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

SimPer: Simple Self-Supervised Learning of Periodic Targets

Arxiv

0+阅读 · 2022年10月6日

SynBench: Task-Agnostic Benchmarking of Pretrained Representations using Synthetic Data

Arxiv

0+阅读 · 2022年10月6日

Meta-Ensemble Parameter Learning

Arxiv

0+阅读 · 2022年10月5日

The Dynamics of Sharpness-Aware Minimization: Bouncing Across Ravines and Drifting Towards Wide Minima

Arxiv

0+阅读 · 2022年10月4日

PersA-FL: Personalized Asynchronous Federated Learning

Arxiv

0+阅读 · 2022年10月3日

Probabilistic Metamodels for an Efficient Characterization of Complex Driving Scenarios

Arxiv

0+阅读 · 2022年9月30日

PL-kNN: A Parameterless Nearest Neighbors Classifier

Arxiv

0+阅读 · 2022年9月30日

Adaptive Methods for Real-World Domain Generalization

Arxiv

13+阅读 · 2021年3月29日

Learning from Few Samples: A Survey

Learning from Few Samples: A Survey

Arxiv

77+阅读 · 2020年7月30日

Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation

Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation

Arxiv

25+阅读 · 2019年10月30日

相关基金

氧、氯负离子在氢等离子体中输运过程研究

国家自然科学基金

0+阅读 · 2015年12月31日

星型胶质细胞参与癫痫发作的动力学机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

长链非编码RNA ENST00000454471对关键基因表达的调控在结直肠癌发生发展中的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

基于低碳经济的能源价格体系动态优化理论与实证研究

国家自然科学基金

0+阅读 · 2012年12月31日

大气压直流微等离子体辅助液相合成和液相修饰纳米粒子研究

国家自然科学基金

0+阅读 · 2012年12月31日

电子束产生开放空间等离子体物理过程研究

国家自然科学基金

0+阅读 · 2012年12月31日

等离子体层顶位形统计研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于周期自适应控制的飞行控制方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于CJT方法和朗道相变理论对强作用物质相变机制的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员