Image-level feature descriptors obtained from convolutional neural networks have shown powerful representation capabilities for image retrieval. In this paper, we present an unsupervised method to aggregate deep convolutional features into compact yet discriminative image vectors by simulating the dynamics of heat diffusion. A distinctive problem in image retrieval is that repetitive or bursty features tend to dominate feature representations, leading to less than ideal matches. We show that by leveraging elegant properties of the heat equation, our method is able to select informative features while avoiding over-representation of bursty features. We additionally present a theoretical time complexity analysis showing the efficiency of our method, which is further demonstrated in our experimental evaluation. Finally, we extensively evaluate the proposed approach with pre-trained and fine-tuned deep networks on common public benchmarks, and show superior performance compared to previous work.
翻译:从进化神经网络获得的图像级特征描述仪显示,图像检索具有强大的代表能力。在本文中,我们提出一种未经监督的方法,通过模拟热扩散的动态,将深层进化特征汇总为紧凑但具有歧视性的图像矢量。图像检索的一个突出问题是,重复性或爆炸性特征往往主导特征表达,导致不理想的匹配。我们表明,通过利用热等式的优雅性能,我们的方法能够选择信息性能,同时避免爆裂性特征的过多。我们还提交了理论性的时间复杂性分析,显示了我们方法的效率,我们在实验性评估中进一步展示了这一点。最后,我们广泛评价了在共同公共基准上预先培训和经过精细调整的深网络的拟议方法,并展示了与以往工作相比的优异性表现。