利用超转者不断进行少热的学习 (Continual Few-Shot Learning Using HyperTransformers)

We focus on the problem of learning without forgetting from multiple tasks arriving sequentially, where each task is defined using a few-shot episode of novel or already seen classes. We approach this problem using the recently published HyperTransformer (HT), a Transformer-based hypernetwork that generates a specialized task-specific CNN weights directly from the support set. In order to learn from a continual sequence of task, we propose to recursively re-use the generated weights as input to the HT for the next task. This way, the generated CNN weights themselves act as a representation of previously learned tasks, and the HT is trained to update these weights so that the new task can be learned without forgetting past tasks. This approach is different from most continual learning algorithms that typically rely on using replay buffers, weight regularization or task-dependent architectural changes. We demonstrate that our proposed Continual HyperTransformer method equipped with a prototypical loss is capable of learning and retaining knowledge about past tasks for a variety of scenarios, including learning from mini-batches, and task-incremental and class-incremental learning scenarios.

翻译：我们注重学习问题,而不会忘记按顺序完成的多项任务,其中每项任务都是用一些新颖或已经看到的课程来界定的。我们使用最近出版的超变异(HT)来处理这个问题,即基于变异器的超网络,直接从支持组中产生与任务有关的特殊有线电视新闻网加权数。为了从连续的任务序列中学习,我们提议在下一个任务中反复使用产生的加权数作为HT的投入。这样,生成的CNN重量本身就代表了以前学到的任务,而HT受过更新这些加权数的训练,这样就可以在不忘过去的任务的情况下学习新任务。这个方法不同于通常依赖使用重新玩缓冲、重量调整或任务独立的建筑变化的最持续学习算法。我们证明,我们提议的具有原型损失的连续超变异方法能够学习和保留关于过去任务的知识,用于各种情景,包括从小型阵列中学习,以及任务性与阶级学习情景。

相关内容

Continuity

关注 4

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日