We propose a scheme for supervised image classification that uses privileged information, in the form of keypoint annotations for the training data, to learn strong models from small and/or biased training sets. Our main motivation is the recognition of animal species for ecological applications such as biodiversity modelling, which is challenging because of long-tailed species distributions due to rare species, and strong dataset biases such as repetitive scene background in camera traps. To counteract these challenges, we propose a visual attention mechanism that is supervised via keypoint annotations that highlight important object parts. This privileged information, implemented as a novel privileged pooling operation, is only required during training and helps the model to focus on regions that are discriminative. In experiments with three different animal species datasets, we show that deep networks with privileged pooling can use small training sets more efficiently and generalize better.
翻译:我们提出一个监督图像分类计划,以培训数据关键点说明的形式使用特许信息,从小型和/或有偏向的培训中学习强有力的模型,我们的主要动机是承认动物物种在生物多样性模型等生态应用中具有挑战性,因为由于稀有物种的长尾分布,生物多样性模型等生态应用具有挑战性,以及诸如相机陷阱中重复的场景背景等强烈的数据集偏差。为了应对这些挑战,我们提出一个视觉关注机制,通过强调重要对象部分的关键点说明加以监督。这一特权信息作为新型的特权集合行动实施,只在培训期间才需要,并帮助该模型侧重于具有歧视性的区域。在三个不同的动物物种数据集的实验中,我们显示拥有特权集合的深层网络可以更高效、更普遍地使用小型的成套培训。