Human beings can recognize new objects with only a few labeled examples, however, few-shot learning remains a challenging problem for machine learning systems. Most previous algorithms in few-shot learning only utilize spatial information of the images. In this paper, we propose to integrate the frequency information into the learning model to boost the discrimination ability of the system. We employ Discrete Cosine Transformation (DCT) to generate the frequency representation, then, integrate the features from both the spatial domain and frequency domain for classification. The proposed strategy and its effectiveness are validated with different backbones, datasets, and algorithms. Extensive experiments demonstrate that the frequency information is complementary to the spatial representations in few-shot classification. The classification accuracy is boosted significantly by integrating features from both the spatial and frequency domains in different few-shot learning tasks.
翻译:人类可以识别新对象,只有几个标签的例子,然而,对机器学习系统来说,少见的学习仍然是一个棘手的问题。以前在少见学习中的算法大多只使用图像的空间信息。在本文件中,我们提议将频率信息纳入学习模式,以提高系统的歧视能力。我们使用分辨孔径转换(DCT)生成频率表达方式,然后将空间域和频率域的特征整合到分类中。拟议战略及其有效性由不同的主干、数据集和算法加以验证。广泛的实验表明,频率信息是对少数分解的空间表达方式的补充。通过将空间和频率领域的特征整合到不同的少见的学习任务中,分类的准确性得到显著提高。