Person search aims at jointly solving Person Detection and Person Re-identification (re-ID). Existing works have designed end-to-end networks based on Faster R-CNN. However, due to the parallel structure of Faster R-CNN, the extracted features come from the low-quality proposals generated by the Region Proposal Network, rather than the detected high-quality bounding boxes. Person search is a fine-grained task and such inferior features will significantly reduce re-ID performance. To address this issue, we propose a Sequential End-to-end Network (SeqNet) to extract superior features. In SeqNet, detection and re-ID are considered as a progressive process and tackled with two sub-networks sequentially. In addition, we design a robust Context Bipartite Graph Matching (CBGM) algorithm to effectively employ context information as an important complementary cue for person matching. Extensive experiments on two widely used person search benchmarks, CUHK-SYSU and PRW, have shown that our method achieves state-of-the-art results. Also, our model runs at 11.5 fps on a single GPU and can be integrated into the existing end-to-end framework easily.
翻译:现有工程设计了以更快R-CNN为基础的端对端网络。然而,由于快速R-CNN的平行结构,所提取的特征来自区域建议网络产生的低质量建议,而不是所检测到的高质量捆绑箱。 人员搜索是一项细微的任务,这种低劣的特征将大大降低重新开发的性能。为了解决这一问题,我们提议建立一个序列端对端网络(SeqNet),以提取优越的特征。在SeqNet中,检测和再开发被视为一个渐进过程,并按顺序由两个子网络处理。此外,我们设计了一种强有力的环境双方图匹配算法,以有效地利用背景信息作为人匹配的重要补充提示。关于两个广泛使用的人搜索基准CUHK-SYSU和PRW的广泛实验表明,我们的方法达到了最新的结果。此外,我们的一个单一GPU的模型在11.5英尺上运行,可以很容易地纳入现有的终端框架。