Image-level feature descriptors obtained from convolutional neural networks have shown powerful representation capabilities for image retrieval. In this paper, we present an unsupervised method to aggregate deep convolutional features into compact yet discriminative image vectors by simulating the dynamics of heat diffusion. A distinctive problem in image retrieval is that repetitive or bursty features tend to dominate feature representations, leading to less than ideal matches. We show that by considering each deep feature as a heat source, our method is able to avoiding over-representation of bursty features. We additionally provide a practical solution for the proposed aggregation method, which is further demonstrated in our experimental evaluation. Finally, we extensively evaluate the proposed approach with pre-trained and fine-tuned deep networks on common public benchmarks, and show superior performance compared to previous work.
翻译:从进化神经网络获得的图像级特征描述仪显示,图像检索具有强大的代表能力。在本文中,我们提出一种未经监督的方法,通过模拟热扩散的动态,将深层的进化特征汇总为紧凑但歧视性的图像矢量。图像检索的一个突出问题是,重复性或爆炸性特征往往主导特征表达,导致不理想的匹配。我们表明,通过将每个深层特征视为热源,我们的方法能够避免爆破特征的过度代表性。我们还为拟议的集成方法提供了切实可行的解决办法,我们在实验性评估中进一步展示了这一办法。最后,我们广泛评价了与共同公共基准的预先培训和精确调整的深层网络所拟议的方法,并展示了与以往工作相比的优异性。