Image-level feature descriptors obtained from convolutional neural networks have shown powerful representation capabilities for image retrieval. In this paper, we present an unsupervised method to aggregate deep convolutional features into compact yet discriminative image vectors by simulating the dynamics of heat diffusion. A distinctive problem in image retrieval is that repetitive or bursty features tend to dominate feature representations, leading to less than ideal matches. We show that by leveraging elegant properties of the heat equation, our method is able to avoiding over-representation of bursty features. We additionally present a theoretical time complexity analysis showing the efficiency of our method, which is further demonstrated in our experimental evaluation. Finally, we extensively evaluate the proposed approach with pre-trained and fine-tuned deep networks on common public benchmarks, and show superior performance compared to previous work.
翻译:从进化神经网络获得的图像级特征描述仪显示,图像检索具有强大的代表能力。在本文中,我们提出一种未经监督的方法,通过模拟热扩散的动态,将深层的进化特征汇总为紧凑但具有歧视性的图像矢量。图像检索的一个突出问题是,重复性或爆炸性特征往往主导特征表达,导致不理想的匹配。我们表明,通过利用热等式的优雅性能,我们的方法能够避免爆裂特征的过度代表性。我们还提交了一份理论性的时间复杂性分析,表明我们的方法的效率,我们在实验性评估中进一步展示了这一点。最后,我们广泛评价了拟议的方法,在共同公共基准上建立预先培训和经过精细调整的深度网络,并展示了与以往工作相比的优异性。