提高元学习中普遍化和记忆效率的动态核心选择 (Dynamic Kernel Selection for Improved Generalization and Memory Efficiency in Meta-learning)

Gradient based meta-learning methods are prone to overfit on the meta-training set, and this behaviour is more prominent with large and complex networks. Moreover, large networks restrict the application of meta-learning models on low-power edge devices. While choosing smaller networks avoid these issues to a certain extent, it affects the overall generalization leading to reduced performance. Clearly, there is an approximately optimal choice of network architecture that is best suited for every meta-learning problem, however, identifying it beforehand is not straightforward. In this paper, we present MetaDOCK, a task-specific dynamic kernel selection strategy for designing compressed CNN models that generalize well on unseen tasks in meta-learning. Our method is based on the hypothesis that for a given set of similar tasks, not all kernels of the network are needed by each individual task. Rather, each task uses only a fraction of the kernels, and the selection of the kernels per task can be learnt dynamically as a part of the inner update steps. MetaDOCK compresses the meta-model as well as the task-specific inner models, thus providing significant reduction in model size for each task, and through constraining the number of active kernels for every task, it implicitly mitigates the issue of meta-overfitting. We show that for the same inference budget, pruned versions of large CNN models obtained using our approach consistently outperform the conventional choices of CNN models. MetaDOCK couples well with popular meta-learning approaches such as iMAML. The efficacy of our method is validated on CIFAR-fs and mini-ImageNet datasets, and we have observed that our approach can provide improvements in model accuracy of up to 2% on standard meta-learning benchmark, while reducing the model size by more than 75%.

翻译：基于梯度的元化学习方法往往过于适合元培训集,而这种行为在大型和复杂的网络中更为突出。此外,大型网络限制对低功率边缘设备应用元学习模型。选择较小的网络在某种程度上避免了这些问题,但会影响总体的概括性,导致性能下降。显然,对于每个元学习问题都有一个最适合于每个元培训集的网络架构选择,但事先确定它并非简单易行。在本文中,我们介绍Metadock,这是设计压缩CNN模型的动态动态内核选择战略,在元学习中广泛推广隐性任务。我们的方法基于一个假设,即对于一组类似的任务,并非每个单个任务都需要网络的所有核心,它会在某种程度上避免这些问题。相反,每项任务只使用部分的内核,而每项任务选取的内核核心内核,作为内部更新步骤的一部分,可以动态地学习。MetDOCock在元模型和特定任务内部模型上进行快速的改进,从而大大降低模式的准确性,因此,我们每个常规任务中每个任务规模的模型都要进行大幅缩小。