通过多族裔邻居关系和双交叉移动进行精细图像分类 (Few-shot Fine-grained Image Classification via Multi-Frequency Neighborhood and Double-cross Modulation)

Traditional fine-grained image classification typically relies on large-scale training samples with annotated ground-truth. However, some sub-categories have few available samples in real-world applications, and current few-shot models still have difficulty in distinguishing subtle differences among fine-grained categories. To solve this challenge, we propose a novel few-shot fine-grained image classification network (FicNet) using multi-frequency neighborhood (MFN) and double-cross modulation (DCM). MFN focuses on both spatial domain and frequency domain to capture multi-frequency structural representations, which reduces the influence of appearance and background changes to the intra-class distance. DCM consists of bi-crisscross component and double 3D cross-attention component. It modulates the representations by considering global context information and inter-class relationship respectively, which enables the support and query samples respond to the same parts and accurately identify the subtle inter-class differences. The comprehensive experiments on three fine-grained benchmark datasets for two few-shot tasks verify that FicNet has excellent performance compared to the state-of-the-art methods. Especially, the experiments on two datasets, "Caltech-UCSD Birds" and "Stanford Cars", can obtain classification accuracy 93.17\% and 95.36\%, respectively. They are even higher than that the general fine-grained image classification methods can achieve.

翻译：传统微细微图像分类通常依赖于具有附加说明的地面真实性的大型培训样本。然而,一些子类在现实世界应用中很少有可用的样本,而目前的微粒模型仍然难以区分细细微分类类别之间的细微差异。为了解决这一挑战,我们建议采用多频邻和双向调制(DCM),建立一个新颖的微小细微微图像分类网络(FicNet),使支持和查询样本能够对同一部分作出反应,并准确地识别微妙的等级差异。最惠国侧重于空间域和频率域,以捕捉多频结构显示,这减少了表象和对类内距离的背景变化的影响。DCM由双镜交叉部分和双三维交叉注意部分组成。DCM通过分别考虑全球背景信息和阶级间关系来调整这些表述,使支持和查询样本能够对同一部分作出反应,并准确地识别微妙的等级差异。最惠国在三个微细微的基底基准数据集上进行了全面实验,用于两项微小的几分位任务,以核实FicNet的性能比州-更高图像的图像和背景分析质量,甚至达到州-格拉氏-C的准确度方法。”,卡纳-C可以分别进行“获得“C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C-C