This paper is a technical report to our submission to the ICCV 2021 VIPriors Re-identification Challenge. In order to make full use of the visual inductive priors of the data, we treat the query and gallery images of the same identity as continuous frames in a video sequence. And we propose one novel post-processing strategy for video temporal relationship mining, which not only calculates the distance matrix between query and gallery images, but also the matrix between gallery images. The initial query image is used to retrieve the most similar image from the gallery, then the retrieved image is treated as a new query to retrieve its most similar image from the gallery. By iteratively searching for the closest image, we can achieve accurate image retrieval and finally obtain a robust retrieval sequence.
翻译:本文是我们提交 ICCV 2021 VIPriors 重新识别挑战的技术报告。 为了充分利用数据的视觉感应前缀, 我们将同一身份的查询和画廊图像视为视频序列中的连续框架。 我们提出一个新的视频时间关系开采后处理策略, 不仅计算查询和画廊图像之间的距离矩阵, 而且还计算画廊图像之间的矩阵。 初始查询图像用于从画廊检索最相似的图像, 然后将检索到的图像作为从画廊检索最相似图像的新查询处理。 通过迭接搜索最接近的图像, 我们就可以实现准确的图像检索, 并最终获得一个稳健的检索序列 。