The crucial problem in vehicle re-identification is to find the same vehicle identity when reviewing this object from cross-view cameras, which sets a higher demand for learning viewpoint-invariant representations. In this paper, we propose to solve this problem from two aspects: constructing robust feature representations and proposing camera-sensitive evaluations. We first propose a novel Heterogeneous Relational Complement Network (HRCN) by incorporating region-specific features and cross-level features as complements for the original high-level output. Considering the distributional differences and semantic misalignment, we propose graph-based relation modules to embed these heterogeneous features into one unified high-dimensional space. On the other hand, considering the deficiencies of cross-camera evaluations in existing measures (i.e., CMC and AP), we then propose a Cross-camera Generalization Measure (CGM) to improve the evaluations by introducing position-sensitivity and cross-camera generalization penalties. We further construct a new benchmark of existing models with our proposed CGM and experimental results reveal that our proposed HRCN model achieves new state-of-the-art in VeRi-776, VehicleID, and VERI-Wild.
翻译:车辆再识别的关键问题是,在审查交叉视像相机中的该物体时找到相同的车辆身份,因为交叉视像相机对学习观点和差异表示的要求更高。在本文件中,我们提议从两个方面解决这一问题:建立强有力的特征表现和提议对相机敏感的评价。我们首先提出一个新的异种关系互补网络(HRCN),将特定区域的特征和跨层次特征作为原始高层次产出的补充。考虑到分布差异和语义失调,我们提议基于图表的关系模块,将这些差异特征嵌入一个统一的高维空间。另一方面,考虑到现有措施(即CMC和AP)中的交叉相机评价的缺陷,我们然后提议一个跨镜头通用措施(CGM),通过引入对位置的敏感度和交叉摄像一般化处罚来改进评估。我们进一步根据我们提议的CGM和实验结果,为现有模型建立一个新的基准,表明我们提议的HR模型在VeRi-776、MISID和VERIWIWAL中实现了新的状态。