任务难度感知的参数分配和正则化用于终身学习 (Task Difficulty Aware Parameter Allocation & Regularization for Lifelong Learning) - 专知论文

会员服务 ·

0

正则化 · PAR · 相关性 · 时间效率 · 搜索策略 ·

2023 年 4 月 11 日

Task Difficulty Aware Parameter Allocation & Regularization for Lifelong Learning

翻译：任务难度感知的参数分配和正则化用于终身学习

Wenjin Wang,Yunqing Hu,Qianglong Chen,Yin Zhang

from arxiv, Accepted by CVPR2023. Code is available at https://github.com/WenjinW/PAR

Parameter regularization or allocation methods are effective in overcoming catastrophic forgetting in lifelong learning. However, they solve all tasks in a sequence uniformly and ignore the differences in the learning difficulty of different tasks. So parameter regularization methods face significant forgetting when learning a new task very different from learned tasks, and parameter allocation methods face unnecessary parameter overhead when learning simple tasks. In this paper, we propose the Parameter Allocation & Regularization (PAR), which adaptively select an appropriate strategy for each task from parameter allocation and regularization based on its learning difficulty. A task is easy for a model that has learned tasks related to it and vice versa. We propose a divergence estimation method based on the Nearest-Prototype distance to measure the task relatedness using only features of the new task. Moreover, we propose a time-efficient relatedness-aware sampling-based architecture search strategy to reduce the parameter overhead for allocation. Experimental results on multiple benchmarks demonstrate that, compared with SOTAs, our method is scalable and significantly reduces the model's redundancy while improving the model's performance. Further qualitative analysis indicates that PAR obtains reasonable task-relatedness.

翻译：参数正则化或分配方法可以有效地克服终身学习中的灾难性遗忘。然而，它们在统一解决所有任务的序列时忽略了不同任务的学习难度差异。因此，当学习与已学任务非常不同的新任务时，参数正则化方法会面临重大遗忘问题，而参数分配方法则会在学习简单任务时面临不必要的参数开销。在本文中，我们提出了参数分配与正则化（PAR），它基于学习难度从参数分配和正则化中自适应地选择适当的策略。对于已经学习过相关任务的模型来说，一项任务是容易的，反之亦然。我们提出了一种基于最近原型距离的散度估计方法，仅使用新任务的特征来测量任务相关性。此外，我们提出了一种时间效率高的基于相关性感知的采样式结构搜索策略，以减少分配的参数开销。在多个基准测试中的实验结果表明，与SOTAs相比，我们的方法是可扩展的，可以显著减少模型的冗余同时提高模型的性能。进一步的定性分析表明，PAR可以获得合理的任务相关性。

0

相关内容

正则化

在数学，统计学和计算机科学中，尤其是在机器学习和逆问题中，正则化是添加信息以解决不适定问题或防止过度拟合的过程。正则化适用于不适定的优化问题中的目标函数。

【AAAI2023】类增量学习的在线超参数优化

【AAAI2023】类增量学习的在线超参数优化

专知会员服务

20+阅读 · 2023年1月18日

【伯克利博士论文】学习在动态环境中泛化，103页pdf

【伯克利博士论文】学习在动态环境中泛化，103页pdf

专知会员服务

72+阅读 · 2022年10月12日

【AAAI2022】跨域少样本图分类

【AAAI2022】跨域少样本图分类

专知会员服务

30+阅读 · 2022年1月22日

【AAAI2022】领域自适应的主动学习:一种基于能量的方法

【AAAI2022】领域自适应的主动学习:一种基于能量的方法

专知会员服务

45+阅读 · 2021年12月6日

【SIGIR2020】策略感知的无偏排序学习—Top-K排序，Policy-Aware Unbiased Learning to Rank for Top-𝑘 Rankings

【SIGIR2020】策略感知的无偏排序学习—Top-K排序，Policy-Aware Unbiased Learning to Rank for Top-𝑘 Rankings

专知会员服务

27+阅读 · 2020年6月10日

【异构图迁移的零样本学习】Heterogeneous Graph-based Knowledge Transfer for Generalized Zero-shot Learning

【异构图迁移的零样本学习】Heterogeneous Graph-based Knowledge Transfer for Generalized Zero-shot Learning

专知会员服务

66+阅读 · 2020年4月17日

【Mila-Google】使用元学习动态调整源代码模型，On-the-Fly Adaptation of Source Code Models using Meta-Learning

【Mila-Google】使用元学习动态调整源代码模型，On-the-Fly Adaptation of Source Code Models using Meta-Learning

专知会员服务

21+阅读 · 2020年3月28日

【SIGMOD2020-CMU】在内存中搜索树的顺序保持键压缩，Order-Preserving Key Compression for In-Memory Search Trees

【SIGMOD2020-CMU】在内存中搜索树的顺序保持键压缩，Order-Preserving Key Compression for In-Memory Search Trees

专知会员服务

15+阅读 · 2020年3月7日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

【论文】欺骗学习（Learning by Cheating）

【论文】欺骗学习（Learning by Cheating）

专知会员服务

28+阅读 · 2020年1月3日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知

16+阅读 · 2020年5月31日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning，33页ppt

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning，33页ppt

专知

71+阅读 · 2020年2月29日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

面向异分布数据的主动学习方法

国家自然科学基金

12+阅读 · 2015年12月31日

非局部总变差正则化图像恢复模型的快速子空间校正算法

国家自然科学基金

0+阅读 · 2014年12月31日

面向异构环境的多任务多视图学习算法研究

国家自然科学基金

3+阅读 · 2014年12月31日

目标实体轮廓跟踪中动态高阶能量最小化问题的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

强磁场织构化高强韧、抗辐照纳米层状类贝壳仿生结构陶瓷

国家自然科学基金

0+阅读 · 2012年12月31日

逼近和恢复的原子范数正则化方法

国家自然科学基金

0+阅读 · 2012年12月31日

面向应急对地观测任务的多平台协同调度方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于混合式学习分类器的协作多机器人系统的调度控制方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

阵列信号多维参数盲联合估计方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

Rover: An online Spark SQL tuning service via generalized transfer learning

Arxiv

0+阅读 · 2023年5月29日

Double-Weighting for Covariate Shift Adaptation

Arxiv

0+阅读 · 2023年5月27日

A Model-Based Solution to the Offline Multi-Agent Reinforcement Learning Coordination Problem

Arxiv

0+阅读 · 2023年5月26日

Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets

Arxiv

0+阅读 · 2023年5月26日

CAILA: Concept-Aware Intra-Layer Adapters for Compositional Zero-Shot Learning

Arxiv

0+阅读 · 2023年5月26日

A Review and Roadmap of Deep Learning Causal Discovery in Different Variable Paradigms

Arxiv

22+阅读 · 2022年9月14日

Adaptive Consistency Regularization for Semi-Supervised Transfer Learning

Arxiv

23+阅读 · 2021年3月3日

Multi-Domain Multi-Task Rehearsal for Lifelong Learning

Multi-Domain Multi-Task Rehearsal for Lifelong Learning

Arxiv

12+阅读 · 2020年12月14日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

End-to-End Multi-Task Learning with Attention

Arxiv

19+阅读 · 2018年3月28日

VIP会员

文章信息

相关主题

相关VIP内容

【AAAI2023】类增量学习的在线超参数优化

【AAAI2023】类增量学习的在线超参数优化

专知会员服务

20+阅读 · 2023年1月18日

【伯克利博士论文】学习在动态环境中泛化，103页pdf

【伯克利博士论文】学习在动态环境中泛化，103页pdf

专知会员服务

72+阅读 · 2022年10月12日

【AAAI2022】跨域少样本图分类

【AAAI2022】跨域少样本图分类

专知会员服务

30+阅读 · 2022年1月22日

【AAAI2022】领域自适应的主动学习:一种基于能量的方法

【AAAI2022】领域自适应的主动学习:一种基于能量的方法

专知会员服务

45+阅读 · 2021年12月6日

【SIGIR2020】策略感知的无偏排序学习—Top-K排序，Policy-Aware Unbiased Learning to Rank for Top-𝑘 Rankings

【SIGIR2020】策略感知的无偏排序学习—Top-K排序，Policy-Aware Unbiased Learning to Rank for Top-𝑘 Rankings

专知会员服务

27+阅读 · 2020年6月10日

【异构图迁移的零样本学习】Heterogeneous Graph-based Knowledge Transfer for Generalized Zero-shot Learning

【异构图迁移的零样本学习】Heterogeneous Graph-based Knowledge Transfer for Generalized Zero-shot Learning

专知会员服务

66+阅读 · 2020年4月17日

【Mila-Google】使用元学习动态调整源代码模型，On-the-Fly Adaptation of Source Code Models using Meta-Learning

【Mila-Google】使用元学习动态调整源代码模型，On-the-Fly Adaptation of Source Code Models using Meta-Learning

专知会员服务

21+阅读 · 2020年3月28日

【SIGMOD2020-CMU】在内存中搜索树的顺序保持键压缩，Order-Preserving Key Compression for In-Memory Search Trees

【SIGMOD2020-CMU】在内存中搜索树的顺序保持键压缩，Order-Preserving Key Compression for In-Memory Search Trees

专知会员服务

15+阅读 · 2020年3月7日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

【论文】欺骗学习（Learning by Cheating）

【论文】欺骗学习（Learning by Cheating）

专知会员服务

28+阅读 · 2020年1月3日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知

16+阅读 · 2020年5月31日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning，33页ppt

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning，33页ppt

专知

71+阅读 · 2020年2月29日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Rover: An online Spark SQL tuning service via generalized transfer learning

Arxiv

0+阅读 · 2023年5月29日

Double-Weighting for Covariate Shift Adaptation

Arxiv

0+阅读 · 2023年5月27日

A Model-Based Solution to the Offline Multi-Agent Reinforcement Learning Coordination Problem

Arxiv

0+阅读 · 2023年5月26日

Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets

Arxiv

0+阅读 · 2023年5月26日

CAILA: Concept-Aware Intra-Layer Adapters for Compositional Zero-Shot Learning

Arxiv

0+阅读 · 2023年5月26日

A Review and Roadmap of Deep Learning Causal Discovery in Different Variable Paradigms

Arxiv

22+阅读 · 2022年9月14日

Adaptive Consistency Regularization for Semi-Supervised Transfer Learning

Arxiv

23+阅读 · 2021年3月3日

Multi-Domain Multi-Task Rehearsal for Lifelong Learning

Multi-Domain Multi-Task Rehearsal for Lifelong Learning

Arxiv

12+阅读 · 2020年12月14日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

End-to-End Multi-Task Learning with Attention

Arxiv

19+阅读 · 2018年3月28日

相关基金

面向异分布数据的主动学习方法

国家自然科学基金

12+阅读 · 2015年12月31日

非局部总变差正则化图像恢复模型的快速子空间校正算法

国家自然科学基金

0+阅读 · 2014年12月31日

面向异构环境的多任务多视图学习算法研究

国家自然科学基金

3+阅读 · 2014年12月31日

目标实体轮廓跟踪中动态高阶能量最小化问题的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

强磁场织构化高强韧、抗辐照纳米层状类贝壳仿生结构陶瓷

国家自然科学基金

0+阅读 · 2012年12月31日

逼近和恢复的原子范数正则化方法

国家自然科学基金

0+阅读 · 2012年12月31日

面向应急对地观测任务的多平台协同调度方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于混合式学习分类器的协作多机器人系统的调度控制方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

阵列信号多维参数盲联合估计方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员