利用超转者不断进行少热的学习 (Continual Few-Shot Learning Using HyperTransformers) - 专知论文

会员服务 ·

0

Continuity · Learning · Weight · 小样本学习 · CNN ·

2023 年 1 月 12 日

Continual Few-Shot Learning Using HyperTransformers

翻译：利用超转者不断进行少热的学习

Max Vladymyrov,Andrey Zhmoginov,Mark Sandler

We focus on the problem of learning without forgetting from multiple tasks arriving sequentially, where each task is defined using a few-shot episode of novel or already seen classes. We approach this problem using the recently published HyperTransformer (HT), a Transformer-based hypernetwork that generates specialized task-specific CNN weights directly from the support set. In order to learn from a continual sequence of tasks, we propose to recursively re-use the generated weights as input to the HT for the next task. This way, the generated CNN weights themselves act as a representation of previously learned tasks, and the HT is trained to update these weights so that the new task can be learned without forgetting past tasks. This approach is different from most continual learning algorithms that typically rely on using replay buffers, weight regularization or task-dependent architectural changes. We demonstrate that our proposed Continual HyperTransformer method equipped with a prototypical loss is capable of learning and retaining knowledge about past tasks for a variety of scenarios, including learning from mini-batches, and task-incremental and class-incremental learning scenarios.

翻译：我们注重学习问题,而不会忘记从按顺序完成的多重任务中产生的重量,其中每项任务都是用一些新颖的或已经看到的课程来界定的。我们使用最近出版的超变异(HT)来处理这个问题,即基于变异器的超网络,直接从支持组中产生与任务有关的特殊CNN加权数。为了从连续的一系列任务中学习,我们提议在下一个任务中反复使用生成的重量作为HT的输入。这样,产生的CNN重量本身就代表了以前学到的任务,而HT受过更新这些加权数的训练,这样就可以在不忘过去的任务的情况下学习新任务。这个方法不同于通常依赖使用重新玩缓冲、重重整或任务独立的建筑变化的最持续学习算法。我们证明,我们提议的具有原型损失的连续超变异变法方法能够学习和保留关于过去任务的知识,以适应各种情景,包括从小型阵列中学习,以及任务、任务分级和阶级学习情景。

0

相关内容

Continuity

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

323+阅读 · 2020年11月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

子痫前期中滋养细胞凋亡坏死与TNF-α异常甲基化的关系研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于Rho/Rho激酶通路研究针刺干预自发性高血压大鼠血管重塑平滑肌细胞增殖、迁移的作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

B细胞活化转录因子（Batf）调节Th17在肝移植排斥反应中的作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

TREM-1/DAP12/ NF-κB信号通路在6-姜烯酚抗动脉粥样硬化中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于NRXNs-NLGNs-SHANK3通路活性及相关基因甲基化修饰变化探讨电针调控自闭症模型大鼠海马突触可塑性的机制

国家自然科学基金

0+阅读 · 2012年12月31日

艾灸抗衰老疗效维持的DNA甲基化调节机制及时效关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于动脉压力反射机制的针刺人迎穴的降压效应研究

国家自然科学基金

0+阅读 · 2012年12月31日

鱼类人工染色体的构建

国家自然科学基金

0+阅读 · 2011年12月31日

表观遗传异常所致新的抑癌基因CDH11调节Wnt信号通路参与细胞癌变及作为头颈肿瘤分子标志物的研究

国家自然科学基金

0+阅读 · 2011年12月31日

颞叶癫痫异常神经发生中HIF-1和神经干细胞作用的实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

SLCA: Slow Learner with Classifier Alignment for Continual Learning on a Pre-trained Model

Arxiv

0+阅读 · 2023年3月9日

Multimodal Parameter-Efficient Few-Shot Class Incremental Learning

Arxiv

0+阅读 · 2023年3月8日

Self-supervised speech representation learning for keyword-spotting with light-weight transformers

Arxiv

0+阅读 · 2023年3月7日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

Recent Advances of Continual Learning in Computer Vision: An Overview

Recent Advances of Continual Learning in Computer Vision: An Overview

Arxiv

22+阅读 · 2021年9月23日

Optimizing Reusable Knowledge for Continual Learning via Metalearning

Arxiv

15+阅读 · 2021年6月9日

Deep Learning for Weakly-Supervised Object Detection and Object Localization: A Survey

Deep Learning for Weakly-Supervised Object Detection and Object Localization: A Survey

Arxiv

16+阅读 · 2021年5月26日

A continual learning survey: Defying forgetting in classification tasks

Arxiv

32+阅读 · 2021年4月16日

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Arxiv

15+阅读 · 2018年10月11日

Rethinking Knowledge Graph Propagation for Zero-Shot Learning

Arxiv

17+阅读 · 2018年5月31日

VIP会员

文章信息

相关主题

小样本学习

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

323+阅读 · 2020年11月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《利用大语言模型（LLM）优化海军陆战队经验教训学习》2025年最新103页

《加拿大陆军顶层作战概念》2025最新33页

超越第一人称视角（FPV）无人机：汲取俄乌战争的全部教训

《瓦洛伦斯（ValoRens）项目 - 预测分析：解读敌方意图》

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

相关论文

SLCA: Slow Learner with Classifier Alignment for Continual Learning on a Pre-trained Model

Arxiv

0+阅读 · 2023年3月9日

Multimodal Parameter-Efficient Few-Shot Class Incremental Learning

Arxiv

0+阅读 · 2023年3月8日

Self-supervised speech representation learning for keyword-spotting with light-weight transformers

Arxiv

0+阅读 · 2023年3月7日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

Recent Advances of Continual Learning in Computer Vision: An Overview

Recent Advances of Continual Learning in Computer Vision: An Overview

Arxiv

22+阅读 · 2021年9月23日

Optimizing Reusable Knowledge for Continual Learning via Metalearning

Arxiv

15+阅读 · 2021年6月9日

Deep Learning for Weakly-Supervised Object Detection and Object Localization: A Survey

Deep Learning for Weakly-Supervised Object Detection and Object Localization: A Survey

Arxiv

16+阅读 · 2021年5月26日

A continual learning survey: Defying forgetting in classification tasks

Arxiv

32+阅读 · 2021年4月16日

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Arxiv

15+阅读 · 2018年10月11日

Rethinking Knowledge Graph Propagation for Zero-Shot Learning

Arxiv

17+阅读 · 2018年5月31日

相关基金

子痫前期中滋养细胞凋亡坏死与TNF-α异常甲基化的关系研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于Rho/Rho激酶通路研究针刺干预自发性高血压大鼠血管重塑平滑肌细胞增殖、迁移的作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

B细胞活化转录因子（Batf）调节Th17在肝移植排斥反应中的作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

TREM-1/DAP12/ NF-κB信号通路在6-姜烯酚抗动脉粥样硬化中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于NRXNs-NLGNs-SHANK3通路活性及相关基因甲基化修饰变化探讨电针调控自闭症模型大鼠海马突触可塑性的机制

国家自然科学基金

0+阅读 · 2012年12月31日

艾灸抗衰老疗效维持的DNA甲基化调节机制及时效关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于动脉压力反射机制的针刺人迎穴的降压效应研究

国家自然科学基金

0+阅读 · 2012年12月31日

鱼类人工染色体的构建

国家自然科学基金

0+阅读 · 2011年12月31日

表观遗传异常所致新的抑癌基因CDH11调节Wnt信号通路参与细胞癌变及作为头颈肿瘤分子标志物的研究

国家自然科学基金

0+阅读 · 2011年12月31日

颞叶癫痫异常神经发生中HIF-1和神经干细胞作用的实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员