强力MAMML:优先任务缓冲,与适应性学习进程相配合,用于模型-不可知性元学习 (Robust MAML: Prioritization task buffer with adaptive learning process for model-agnostic meta-learning)

Model agnostic meta-learning (MAML) is a popular state-of-the-art meta-learning algorithm that provides good weight initialization of a model given a variety of learning tasks. The model initialized by provided weight can be fine-tuned to an unseen task despite only using a small amount of samples and within a few adaptation steps. MAML is simple and versatile but requires costly learning rate tuning and careful design of the task distribution which affects its scalability and generalization. This paper proposes a more robust MAML based on an adaptive learning scheme and a prioritization task buffer(PTB) referred to as Robust MAML (RMAML) for improving scalability of training process and alleviating the problem of distribution mismatch. RMAML uses gradient-based hyper-parameter optimization to automatically find the optimal learning rate and uses the PTB to gradually adjust train-ing task distribution toward testing task distribution over the course of training. Experimental results on meta reinforcement learning environments demonstrate a substantial performance gain as well as being less sensitive to hyper-parameter choice and robust to distribution mismatch.

翻译：模型不可知元学习(MAML)是一种流行的先进元学习算法,它为一种模式提供了良好的加权初始化,具有各种学习任务,提供重量的模型可以微调适应一项看不见的任务,尽管只是使用少量的样本,而且只是在若干调整步骤之内。MAML是简单和多功能的,但需要昂贵的学习率调整和仔细设计任务分配,从而影响其可缩放性和概括性。本文提议基于适应性学习计划和优先排序任务缓冲(PTB),称为Robust MAML(RMAML),以提高培训进程的可扩展性并减轻分配不匹配问题。RMAML利用基于梯度的超参数优化自动找到最佳学习率,并利用PTB逐步调整培训任务分配,以测试培训课程的分布。元强化学习环境的实验结果显示,业绩获得很大的提高,对超参数选择不敏感,而且对分配不匹配性强。

相关内容

MAML

关注 42

MAML（Model-Agnostic Meta-Learning）是元学习（Meta learning）最经典的几个算法之一，出自论文《Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks》。原文地址：https://arxiv.org/abs/1703.03400

【斯坦福大学课程】2021年深度多任务学习与元学习，CS 330: Deep Multi-Task and Meta Learning

专知会员服务

110+阅读 · 2022年3月2日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【斯坦福大牛Chelsea Finn2020新课】深度多任务和元学习，附课程PPT下载

专知会员服务

56+阅读 · 2020年10月24日

元学习(meta learning) 最新进展综述论文

专知会员服务

281+阅读 · 2020年5月8日