Few-shot learning (FSL), which aims to recognise new classes by adapting the learned knowledge with extremely limited few-shot (support) examples, remains an important open problem in computer vision. Most of the existing methods for feature alignment in few-shot learning only consider image-level or spatial-level alignment while omitting the channel disparity. Our insight is that these methods would lead to poor adaptation with redundant matching, and leveraging channel-wise adjustment is the key to well adapting the learned knowledge to new classes. Therefore, in this paper, we propose to learn a dynamic alignment, which can effectively highlight both query regions and channels according to different local support information. Specifically, this is achieved by first dynamically sampling the neighbourhood of the feature position conditioned on the input few shot, based on which we further predict a both position-dependent and channel-dependent Dynamic Meta-filter. The filter is used to align the query feature with position-specific and channel-specific knowledge. Moreover, we adopt Neural Ordinary Differential Equation (ODE) to enable a more accurate control of the alignment. In such a sense our model is able to better capture fine-grained semantic context of the few-shot example and thus facilitates dynamical knowledge adaptation for few-shot learning. The resulting framework establishes the new state-of-the-arts on major few-shot visual recognition benchmarks, including miniImageNet and tieredImageNet.
翻译:少见的学习(FSL)旨在通过以极有限的少见(支持)实例调整所学知识来认识新课程,这仍然是计算机视觉中一个重要的开放问题。在少见学习中,现有的功能调整方法大多只考虑图像水平或空间水平的对齐,而忽略频道差异。我们的见解是,这些方法会导致适应性差,产生冗余匹配,而利用频道的调整是使所学知识适应新课程的关键。因此,在本文件中,我们提议学习动态对齐,根据不同的当地支持信息,有效地突出查询区域和渠道。具体地说,这是通过首先对以少量投入为条件的功能位置位置位置的周边进行动态抽样取样来实现的,在此基础上,我们进一步预测一个既依赖位置又依赖频道动态动态的动态元化过滤器。过滤器用来使查询功能与位置特定和频道特定知识相匹配。此外,我们采用了神经普通差异度(ODE),以便能够更准确地控制调整。从这个意义上说,我们的模型能够更好地捕捉到精准的查询区域和渠道。我们模型能够更好地采集以少数输入的图像网格定位位置位置,从而确定主要的图像化框架,从而建立动态的动态的图像化基本认识。