持续评估的终身学习：确定稳定性差距 (Continual evaluation for lifelong learning: Identifying the stability gap) - 专知论文

会员服务 ·

0

增量学习 · 梯度 · 时间依赖 · 经验回放 · 数据生成分布 ·

2023 年 3 月 30 日

Continual evaluation for lifelong learning: Identifying the stability gap

翻译：持续评估的终身学习：确定稳定性差距

Matthias De Lange,Gido van de Ven,Tinne Tuytelaars

from arxiv, Published as spotlight paper at ICLR 2023

Time-dependent data-generating distributions have proven to be difficult for gradient-based training of neural networks, as the greedy updates result in catastrophic forgetting of previously learned knowledge. Despite the progress in the field of continual learning to overcome this forgetting, we show that a set of common state-of-the-art methods still suffers from substantial forgetting upon starting to learn new tasks, except that this forgetting is temporary and followed by a phase of performance recovery. We refer to this intriguing but potentially problematic phenomenon as the stability gap. The stability gap had likely remained under the radar due to standard practice in the field of evaluating continual learning models only after each task. Instead, we establish a framework for continual evaluation that uses per-iteration evaluation and we define a new set of metrics to quantify worst-case performance. Empirically we show that experience replay, constraint-based replay, knowledge-distillation, and parameter regularization methods are all prone to the stability gap; and that the stability gap can be observed in class-, task-, and domain-incremental learning benchmarks. Additionally, a controlled experiment shows that the stability gap increases when tasks are more dissimilar. Finally, by disentangling gradients into plasticity and stability components, we propose a conceptual explanation for the stability gap.

翻译：时间依赖性数据生成分布已被证明是神经网络梯度训练困难的问题，因为贪婪式的更新会导致先前学到的知识遗忘。尽管终身学习领域已取得进展以克服此遗忘，但我们发现一组常见的最先进方法仍然存在严重的遗忘问题，只是这种遗忘是暂时的并随后进入性能恢复阶段。我们将这一引人注目但潜在问题严重的现象称为稳定性差距。由于终身学习模型的标准评估惯例仅在每个任务后进行评估，因此稳定性差距可能一直没有被察觉。因此，我们建立了一个持续评估框架，使用每次迭代的评估，并定义了一组新的指标来量化最坏情况下的性能。实证结果表明，经验回放，基于约束的回放、知识蒸馏和参数规范化方法都容易出现稳定性差距，并且稳定性差距可以在类增量学习，任务增量学习和域增量学习基准测试中观察到。此外，一个对照实验表明，稳定性差距随着任务之间的差异增加而增加。最后，通过将梯度分解为可塑性和稳定性组件，我们提出了稳定性差距的概念性解释。

0

相关内容

增量学习

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

神经网络的持续终身学习综述论文

专知会员服务

44+阅读 · 2021年5月19日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【CVPR2020】物体实例持续学习，Continual Learning of Object Instances

【CVPR2020】物体实例持续学习，Continual Learning of Object Instances

专知会员服务

32+阅读 · 2020年4月26日

【CVPR2020-Facebook AI】单样本自适应域脸生成，One-Shot Domain Adaptation

【CVPR2020-Facebook AI】单样本自适应域脸生成，One-Shot Domain Adaptation

专知会员服务

29+阅读 · 2020年4月6日

【SIGMOD2020】知识图谱补全方法的现实再评价，Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study

【SIGMOD2020】知识图谱补全方法的现实再评价，Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study

专知会员服务

33+阅读 · 2020年3月23日

【MIT】生成模型提出的分子的可合成性，48页pdf,The Synthesizability of Molecules Proposed by Generative Models

【MIT】生成模型提出的分子的可合成性，48页pdf,The Synthesizability of Molecules Proposed by Generative Models

专知会员服务

28+阅读 · 2020年2月20日

【斯坦福大学】面向可解释人工智能:神经网络的显著性检验（Towards Explainable AI: Significance Tests for Neural Networks），26页pdf

【斯坦福大学】面向可解释人工智能:神经网络的显著性检验（Towards Explainable AI: Significance Tests for Neural Networks），26页pdf

专知会员服务

27+阅读 · 2019年12月19日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新6篇视觉问答（VQA）相关论文—目标推理、深度循环模型、可解释性、数据可视化、Triplet学习、基准

【论文推荐】最新6篇视觉问答（VQA）相关论文—目标推理、深度循环模型、可解释性、数据可视化、Triplet学习、基准

专知

15+阅读 · 2018年2月3日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

蛋白激酶D1调控神经型钙粘素N-cadherin促进突触发育和学习记忆的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

人Muse细胞诱导分化为神经前体细胞及功能性神经元并修复脊髓损伤

国家自然科学基金

0+阅读 · 2015年12月31日

新疆北部早春短命植物独行菜种子低温萌发停滞的调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

高速滑坡碎裂化过程及运动放大效应的试验和模型研究

国家自然科学基金

0+阅读 · 2014年12月31日

M1胆碱受体对AMPA受体GluA1亚基的调控及其在突触长时程增强和学习记忆中的作用及机制

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA lncLCSC调控肝癌干细胞自我更新的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

人牙源性TF-iPS细胞miRNAs谱系特征及其促进牙髓再生的研究

国家自然科学基金

0+阅读 · 2012年12月31日

受限制策略下多臂Bandit过程的理论与应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

自凝胶和自释放功能性纳米载体肿瘤定位持续释放siRNA与化疗药物

国家自然科学基金

0+阅读 · 2012年12月31日

供水管网性能综合评价与多目标更新优化模型研究

国家自然科学基金

0+阅读 · 2011年12月31日

GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective

Arxiv

0+阅读 · 2023年5月22日

Exploring the Viability of Synthetic Query Generation for Relevance Prediction

Arxiv

0+阅读 · 2023年5月19日

A Survey of Federated Evaluation in Federated Learning

Arxiv

0+阅读 · 2023年5月19日

Deep Class-Incremental Learning: A Survey

Arxiv

13+阅读 · 2023年2月7日

A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems

Arxiv

17+阅读 · 2023年1月18日

Recent Advances of Continual Learning in Computer Vision: An Overview

Recent Advances of Continual Learning in Computer Vision: An Overview

Arxiv

22+阅读 · 2021年9月23日

Explainable Deep Learning: A Field Guide for the Uninitiated

Arxiv

51+阅读 · 2021年9月13日

Image/Video Deep Anomaly Detection: A Survey

Arxiv

16+阅读 · 2021年3月2日

Continual Lifelong Learning with Neural Networks: A Review

Arxiv

14+阅读 · 2019年2月11日

Explainable Recommendation: A Survey and New Perspectives

Arxiv

11+阅读 · 2018年5月13日

VIP会员

文章信息

相关主题

数据生成分布

相关VIP内容

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

神经网络的持续终身学习综述论文

专知会员服务

44+阅读 · 2021年5月19日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【CVPR2020】物体实例持续学习，Continual Learning of Object Instances

【CVPR2020】物体实例持续学习，Continual Learning of Object Instances

专知会员服务

32+阅读 · 2020年4月26日

【CVPR2020-Facebook AI】单样本自适应域脸生成，One-Shot Domain Adaptation

【CVPR2020-Facebook AI】单样本自适应域脸生成，One-Shot Domain Adaptation

专知会员服务

29+阅读 · 2020年4月6日

【SIGMOD2020】知识图谱补全方法的现实再评价，Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study

【SIGMOD2020】知识图谱补全方法的现实再评价，Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study

专知会员服务

33+阅读 · 2020年3月23日

【MIT】生成模型提出的分子的可合成性，48页pdf,The Synthesizability of Molecules Proposed by Generative Models

【MIT】生成模型提出的分子的可合成性，48页pdf,The Synthesizability of Molecules Proposed by Generative Models

专知会员服务

28+阅读 · 2020年2月20日

【斯坦福大学】面向可解释人工智能:神经网络的显著性检验（Towards Explainable AI: Significance Tests for Neural Networks），26页pdf

【斯坦福大学】面向可解释人工智能:神经网络的显著性检验（Towards Explainable AI: Significance Tests for Neural Networks），26页pdf

专知会员服务

27+阅读 · 2019年12月19日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津博士论文】零样本强化学习综述

《美军条令：陆军指挥官与规划人员地理空间指南》60页

战术边缘指挥控制：防务面临的核心挑战

迈向开放世界检测：综述

相关资讯

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新6篇视觉问答（VQA）相关论文—目标推理、深度循环模型、可解释性、数据可视化、Triplet学习、基准

【论文推荐】最新6篇视觉问答（VQA）相关论文—目标推理、深度循环模型、可解释性、数据可视化、Triplet学习、基准

专知

15+阅读 · 2018年2月3日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective

Arxiv

0+阅读 · 2023年5月22日

Exploring the Viability of Synthetic Query Generation for Relevance Prediction

Arxiv

0+阅读 · 2023年5月19日

A Survey of Federated Evaluation in Federated Learning

Arxiv

0+阅读 · 2023年5月19日

Deep Class-Incremental Learning: A Survey

Arxiv

13+阅读 · 2023年2月7日

A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems

Arxiv

17+阅读 · 2023年1月18日

Recent Advances of Continual Learning in Computer Vision: An Overview

Recent Advances of Continual Learning in Computer Vision: An Overview

Arxiv

22+阅读 · 2021年9月23日

Explainable Deep Learning: A Field Guide for the Uninitiated

Arxiv

51+阅读 · 2021年9月13日

Image/Video Deep Anomaly Detection: A Survey

Arxiv

16+阅读 · 2021年3月2日

Continual Lifelong Learning with Neural Networks: A Review

Arxiv

14+阅读 · 2019年2月11日

Explainable Recommendation: A Survey and New Perspectives

Arxiv

11+阅读 · 2018年5月13日

相关基金

蛋白激酶D1调控神经型钙粘素N-cadherin促进突触发育和学习记忆的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

人Muse细胞诱导分化为神经前体细胞及功能性神经元并修复脊髓损伤

国家自然科学基金

0+阅读 · 2015年12月31日

新疆北部早春短命植物独行菜种子低温萌发停滞的调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

高速滑坡碎裂化过程及运动放大效应的试验和模型研究

国家自然科学基金

0+阅读 · 2014年12月31日

M1胆碱受体对AMPA受体GluA1亚基的调控及其在突触长时程增强和学习记忆中的作用及机制

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA lncLCSC调控肝癌干细胞自我更新的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

人牙源性TF-iPS细胞miRNAs谱系特征及其促进牙髓再生的研究

国家自然科学基金

0+阅读 · 2012年12月31日

受限制策略下多臂Bandit过程的理论与应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

自凝胶和自释放功能性纳米载体肿瘤定位持续释放siRNA与化疗药物

国家自然科学基金

0+阅读 · 2012年12月31日

供水管网性能综合评价与多目标更新优化模型研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员