Existing person re-identification models often have low generalizability, which is mostly due to limited availability of large-scale labeled data in training. However, labeling large-scale training data is very expensive and time-consuming, while large-scale synthetic dataset shows promising value in learning generalizable person re-identification models. Therefore, in this paper a novel and practical person re-identification task is proposed,i.e. how to use labeled synthetic dataset and unlabeled real-world dataset to train a universal model. In this way, human annotations are no longer required, and it is scalable to large and diverse real-world datasets. To address the task, we introduce a framework with high generalizability, namely DomainMix. Specifically, the proposed method firstly clusters the unlabeled real-world images and selects the reliable clusters. During training, to address the large domain gap between two domains, a domain-invariant feature learning method is proposed, which introduces a new loss,i.e. domain balance loss, to conduct an adversarial learning between domain-invariant feature learning and domain discrimination, and meanwhile learns a discriminative feature for person re-identification. This way, the domain gap between synthetic and real-world data is much reduced, and the learned feature is generalizable thanks to the large-scale and diverse training data. Experimental results show that the proposed annotation-free method is more or less comparable to the counterpart trained with full human annotations, which is quite promising. In addition, it achieves the current state of the art on several person re-identification datasets under direct cross-dataset evaluation.
翻译:现有个人再识别模型往往具有较低的通用性,这主要是因为在培训中大规模贴标签的数据有限。然而,标出大规模培训数据非常昂贵和耗时,而大型合成数据集则显示学习通用人再识别模型的有前景价值。因此,本文件提议了一种创新和实用的人再识别任务,即:如何使用标签合成数据集和未贴标签的真实世界数据集来培训通用模型。这样,就不再需要人的对口说明,而这种对口说明可适用于大型和多样化的现实世界数据集。为完成任务,我们引入了一个具有高度通用性的框架,即DolainMix。具体地说,拟议的方法首先将未贴标签的真实世界图像组合起来,并选择可靠的组群。在培训中,为了解决两个领域之间的大域间差距,提出了一种域内变异特性学习方法,从而带来新的损失,即域间平衡损失,在域内对当前差异和域域间数据群集进行对等学习。为了应对任务,我们引入一个具有高度通用性的框架,我们引入了一个高度通用的、可比较性、可比较性数据,这个模型显示高等级的模型显示高等级数据。