Person re-identification is a key technology for analyzing video-based human behavior; however, its application is still challenging in practical situations due to the performance degradation for domains different from those in the training data. Here, we propose an environment-constrained adaptive network for reducing the domain gap. This network refines pseudo-labels estimated via a self-training scheme by imposing multi-camera constraints. The proposed method incorporates person-pair information without person identity labels obtained from the environment into the model training. In addition, we develop a method that appropriately selects a person from the pair that contributes to the performance improvement. We evaluate the performance of the network using public and private datasets and confirm the performance surpasses state-of-the-art methods in domains with overlapping camera views. To the best of our knowledge, this is the first study on domain-adaptive learning with multi-camera constraints that can be obtained in real environments.
翻译:个人再识别是分析基于视频的人类行为的关键技术; 但是,由于与培训数据不同领域的性能退化,在实际情况下,其应用仍然具有挑战性。 我们在这里建议建立一个环境限制的适应网络,以减少领域差距。 这个网络通过强制实施多相机限制,通过自我培训计划改进假标签。 拟议的方法将没有从环境中获得的个人身份标签的个人- 配偶信息纳入模式培训。 此外,我们开发了一种方法,从有助于绩效改进的对口中适当选择一个人。 我们使用公共和私人数据集评估网络的性能,并证实其性能超过了在有重叠相机观点的领域采用的最先进方法。 据我们所知,这是关于在真实环境中可以获得的多相机限制的对地适应性学习的首次研究。