Person re-identification is a challenging task because of the high intra-class variance induced by the unrestricted nuisance factors of variations such as pose, illumination, viewpoint, background, and sensor noise. Recent approaches postulate that powerful architectures have the capacity to learn feature representations invariant to nuisance factors, by training them with losses that minimize intra-class variance and maximize inter-class separation, without modeling nuisance factors explicitly. The dominant approaches use either a discriminative loss with margin, like the softmax loss with the additive angular margin, or a metric learning loss, like the triplet loss with batch hard mining of triplets. Since the softmax imposes feature normalization, it limits the gradient flow supervising the feature embedding. We address this by joining the losses and leveraging the triplet loss as a proxy for the missing gradients. We further improve invariance to nuisance factors by adding the discriminative task of predicting attributes. Our extensive evaluation highlights that when only a holistic representation is learned, we consistently outperform the state-of-the-art on the three most challenging datasets. Such representations are easier to deploy in practical systems. Finally, we found that joining the losses removes the requirement for having a margin in the softmax loss while increasing performance.
翻译:重新确定身份是一项具有挑战性的任务,因为各种变异因素,如成因、照明、观点、背景和感官噪音等无限制的干扰因素,造成了高阶级内部差异。最近的一些方法假设,强势建筑有能力学习与扰动因素不同的特征表现,通过培训他们遭受损失,最大限度地减少阶级内部差异,最大限度地实现阶级间分离,而没有明确的建模干扰因素。主导方法使用带有差幅的歧视性损失,如与角边距相加的软峰值损失,或计量学习损失,如分批硬采矿三重体的三重损失。自软式马克思实行特征正常化以来,它限制了控制特性嵌入的梯度流动。我们通过合并损失并利用三重损失作为缺失梯度的代名来解决这一问题。我们通过增加预测属性的歧视性任务,进一步提高对扰动因素的不稳定性。我们的广泛评价强调,只要了解整体代表性,我们就会不断超越三个最具挑战性的数据集的状态。我们发现,在增加软性损失的系统中更容易地部署。