Deep networks can learn to accurately recognize objects of a category by training on a large number of annotated images. However, a meta-learning challenge known as a low-shot image recognition task comes when only a few images with annotations are available for learning a recognition model for one category. The objects in testing/query and training/support images are likely to be different in size, location, style, and so on. Our method, called Cascaded Feature Matching Network (CFMN), is proposed to solve this problem. We train the meta-learner to learn a more fine-grained and adaptive deep distance metric by focusing more on the features that have high correlations between compared images by the feature matching block which can align associated features together and naturally ignore those non-discriminative features. By applying the proposed feature matching block in different layers of the few-shot recognition network, multi-scale information among the compared images can be incorporated into the final cascaded matching feature, which boosts the recognition performance further and generalizes better by learning on relationships. The experiments for few-shot learning on two standard datasets, \emph{mini}ImageNet and Omniglot, have confirmed the effectiveness of our method. Besides, the multi-label few-shot task is first studied on a new data split of COCO which further shows the superiority of the proposed feature matching network when performing few-shot learning in complex images. The code will be made publicly available.
翻译:深网络可以通过培训大量附加说明的图像来学习准确识别某类的物体。 然而,当只有少量带有注释的图像可用于学习某类的识别模型时,就会出现被称为低光图像识别任务的元化学习挑战。测试/询问和培训/支持图像中的对象在大小、位置、风格等方面可能不同。我们称为连锁地物匹配网络(CFMN)的方法可以解决这一问题。我们培训元数据学习元数据以学习更精细的和适应性更深距离的衡量标准,通过更多关注功能匹配区块的图像之间具有高度相关性的特征,这些特征可以将相关特征相匹配,并自然忽略这些非差异性特征。通过在少光谱识别网络的不同层应用拟议的特征匹配区块,比较图像中的多尺度信息将被纳入最终的级联匹配功能中,通过学习关系来进一步提升认识性,从而更好地进行更精细的深距离测量度度度度度度度测量,方法是通过功能匹配区块块比较区块来更精确地学习两个标准数据集,这些相比图像的相比对图像具有高度关联性, CO=高级网络的分数,在研究后, 任务的精度将演示的精度为精度为精度。