Re-ranking utilizes contextual information to optimize the initial ranking list of person or vehicle re-identification (re-ID), which boosts the retrieval performance at post-processing steps. This paper proposes a re-ranking network to predict the correlations between the probe and top-ranked neighbor samples. Specifically, all the feature embeddings of query and gallery images are expanded and enhanced by a linear combination of their neighbors, with the correlation prediction serving as discriminative combination weights. The combination process is equivalent to moving independent embeddings toward the identity centers, improving cluster compactness. For correlation prediction, we first aggregate the contextual information for probe's k-nearest neighbors via the Transformer encoder. Then, we distill and refine the probe-related features into the Contextual Memory cell via attention mechanism. Like humans that retrieve images by not only considering probe images but also memorizing the retrieved ones, the Contextual Memory produces multi-view descriptions for each instance. Finally, the neighbors are reconstructed with features fetched from the Contextual Memory, and a binary classifier predicts their correlations with the probe. Experiments on six widely-used person and vehicle re-ID benchmarks demonstrate the effectiveness of the proposed method. Especially, our method surpasses the state-of-the-art re-ranking approaches on large-scale datasets by a significant margin, i.e., with an average 4.83% CMC@1 and 14.83% mAP improvements on VERI-Wild, MSMT17, and VehicleID datasets.
翻译:重新排序利用背景信息优化最初的个人或车辆再识别排名列表(re-ID),这提高了后处理步骤的检索性能。本文建议重新排序网络,以预测探测器和最上层邻居样本之间的相互关系。具体地说,查询和画廊图像的所有嵌入功能都通过邻居的线性组合而扩大和增强,相关预测是歧视性的组合权重。合并过程相当于将独立嵌入身份中心、改进集群紧凑度。对于相关预测,我们首先通过变换器编码器将探测器最近邻的相邻方的背景信息汇总起来。然后,我们通过注意机制将与探测有关的特征提炼和精炼到内存细胞中去。正如通过不仅考虑探测图像,而且对检索图像进行记忆的线性组合一样,背景记忆为每个实例制作了多视图描述。最后,邻居们通过从背景记忆中获取的特征来重建,并且用一个二元分级分类器预测其与探测器的关联性关系。然后,我们通过注意机制将探测器的探测相关特征特性相关特性相关特征进行筛选。 4. 在六个广泛使用的人中进行实验,以大规模的数据模型重新标定方法和车辆升级。