Person re-identification (re-id) is a pivotal task within an intelligent surveillance pipeline and there exist numerous re-id frameworks that achieve satisfactory performance in challenging benchmarks. However, these systems struggle to generate acceptable results when there are significant differences between the camera views, illumination conditions, or occlusions. This result can be attributed to the deficiency that exists within many recently proposed re-id pipelines where they are predominately driven by appearance-based features and little attention is paid to other auxiliary information that could aid the re-id. In this paper, we systematically review the current State-Of-The-Art (SOTA) methods in both uni-modal and multimodal person re-id. Extending beyond a conceptual framework, we illustrate how the existing SOTA methods can be extended to support these additional auxiliary information and quantitatively evaluate the utility of such auxiliary feature information, ranging from logos printed on the objects carried by the subject or printed on the clothes worn by the subject, through to his or her behavioural trajectories. To the best of our knowledge, this is the first work that explores the fusion of multiple information to generate a more discriminant person descriptor and the principal aim of this paper is to provide a thorough theoretical analysis regarding the implementation of such a framework. In addition, using model interpretation techniques, we validate the contributions from different combinations of the auxiliary information versus the original features that the SOTA person re-id models extract. We outline the limitations of the proposed approaches and propose future research directions that could be pursued to advance the area of multi-modal person re-id.
翻译:在智能监视管道中,重新定位(重新定位)是一项关键任务,而且有许多在挑战性基准方面达到令人满意的业绩的重新定位框架,然而,这些系统在摄像机观点、照明条件或封闭性之间存在巨大差异的情况下,难以产生可接受的结果,其结果可归因于最近提出的许多重新定位管道中存在的缺陷,这些管道主要是外观特征驱动的,很少注意有助于重新定位的其他辅助信息。在本文件中,我们系统地审查单式和多式联运人员重新定位的当前国家-艺术(SOTA)方法。在概念框架之外,我们说明如何扩大现有的SOTA方法,以支持这些额外的辅助信息,并从数量上评价这种辅助性特征信息的效用,从在主题所携带的物体上打印的标志,或印在主题所穿的衣服上打印的标志,到他或她的行为轨迹。我们最了解的是,这是探索多种信息组合的单一模式和先期期方法。我们探索了未来研究模型的组合,以产生更彻底的图像分析为目的,我们用这种模型来提供更精确的图像分析。