Metric-based few-shot fine-grained image classification (FSFGIC) aims to learn a transferable feature embedding network by estimating the similarities between query images and support classes from very few examples. In this work, we propose, for the first time, to introduce the non-linear data projection concept into the design of FSFGIC architecture in order to address the limited sample problem in few-shot learning and at the same time to increase the discriminability of the model for fine-grained image classification. Specifically, we first design a feature re-abstraction embedding network that has the ability to not only obtain the required semantic features for effective metric learning but also re-enhance such features with finer details from input images. Then the descriptors of the query images and the support classes are projected into different non-linear spaces in our proposed similarity metric learning network to learn discriminative projection factors. This design can effectively operate in the challenging and restricted condition of a FSFGIC task for making the distance between the samples within the same class smaller and the distance between samples from different classes larger and for reducing the coupling relationship between samples from different categories. Furthermore, a novel similarity measure based on the proposed non-linear data project is presented for evaluating the relationships of feature information between a query image and a support set. It is worth to note that our proposed architecture can be easily embedded into any episodic training mechanisms for end-to-end training from scratch. Extensive experiments on FSFGIC tasks demonstrate the superiority of the proposed methods over the state-of-the-art benchmarks.


翻译:在这项工作中,我们首次提议将非线性数据投影概念引入FSFGIC架构的设计中,以便在少发的学习中解决有限的抽样问题,同时提高微分图像分类模型的可视性。具体地说,我们首先设计一个功能再吸附嵌入网络,不仅能够获得有效计量学习所需的语义特征,而且还能够以输入图像的细细细节重新加强这些特征。然后,将查询图像和支持课程的描述器投放到我们拟议的类似指标学习网络中不同的非线性空间,以学习歧视性预测因素。这种设计可以有效地在FSFGIC的拟议任务中以挑战性和限制性条件运作,使样本在相同等级内进行较小和不同类别样本之间距离的重复性测试方法,从较大类别获得所需的语义性语言学习,以及用输入图像图像的精细细节来重新加强这些特征。 用于评估我们拟议中的“FSFIIGIC”结构图样图样图样图的缩略图和缩略图图图图图图的缩图解图解图解图解图解图解图谱之间的缩缩缩缩缩缩缩图。

0
下载
关闭预览

相关内容

图像分类,顾名思义,是一个输入图像,输出对该图像内容分类的描述的问题。它是计算机视觉的核心,实际应用广泛。
【图与几何深度学习】Graph and geometric deep learning,49页ppt
零样本文本分类,Zero-Shot Learning for Text Classification
专知会员服务
95+阅读 · 2020年5月31日
【清华大学】图随机神经网络,Graph Random Neural Networks
专知会员服务
154+阅读 · 2020年5月26日
内涵网络嵌入:Content-rich Network Embedding
我爱读PAMI
4+阅读 · 2019年11月5日
Hierarchically Structured Meta-learning
CreateAMind
25+阅读 · 2019年5月22日
小样本学习(Few-shot Learning)综述
云栖社区
22+阅读 · 2019年4月6日
Unsupervised Learning via Meta-Learning
CreateAMind
41+阅读 · 2019年1月3日
Hierarchical Imitation - Reinforcement Learning
CreateAMind
19+阅读 · 2018年5月25日
Hierarchical Disentangled Representations
CreateAMind
4+阅读 · 2018年4月15日
条件GAN重大改进!cGANs with Projection Discriminator
CreateAMind
8+阅读 · 2018年2月7日
【推荐】ResNet, AlexNet, VGG, Inception:各种卷积网络架构的理解
机器学习研究会
20+阅读 · 2017年12月17日
Arxiv
7+阅读 · 2020年3月1日
VIP会员
相关资讯
Top
微信扫码咨询专知VIP会员