We exploit field guides to learn bird species recognition, in particular zero-shot recognition of unseen species. The illustrations contained in field guides deliberately focus on discriminative properties of a species, and can serve as side information to transfer knowledge from seen to unseen classes. We study two approaches: (1) a contrastive encoding of illustrations that can be fed into zero-shot learning schemes; and (2) a novel method that leverages the fact that illustrations are also images and as such structurally more similar to photographs than other kinds of side information. Our results show that illustrations from field guides, which are readily available for a wide range of species, are indeed a competitive source of side information. On the iNaturalist2021 subset, we obtain a harmonic mean from 749 seen and 739 unseen classes greater than $45\%$ (@top-10) and $15\%$ (@top-1). Which shows that field guides are a valuable option for challenging real-world scenarios with many species.
翻译:我们利用实地指南来学习鸟类物种的识别,尤其是对看不见物种的零光识别。实地指南中的插图有意侧重于物种的歧视性特性,并且可以作为将知识从可见的类别转移到不可见的类别。我们研究了两种方法:(1)对插图的对比编码,可以输入零光学习计划;(2)利用插图也是图像这一事实的新颖方法,在结构上比其他类型的侧信息更相似。我们的结果表明,实地指南中的插图确实是一个竞争的侧面信息来源。在iNaturallist2021子集中,我们从749个可见的和739个可见的类别获得了一个协调的平均值,大于45美元(@top-10)和15美元(@top-1),这表明实地指南是挑战许多物种真实世界情景的一个宝贵选择。