Pedestrian detection in a crowd is a challenging task due to a high number of mutually-occluding human instances, which brings ambiguity and optimization difficulties to the current IoU-based ground truth assignment procedure in classical object detection methods. In this paper, we develop a unique perspective of pedestrian detection as a variational inference problem. We formulate a novel and efficient algorithm for pedestrian detection by modeling the dense proposals as a latent variable while proposing a customized Auto Encoding Variational Bayes (AEVB) algorithm. Through the optimization of our proposed algorithm, a classical detector can be fashioned into a variational pedestrian detector. Experiments conducted on CrowdHuman and CityPersons datasets show that the proposed algorithm serves as an efficient solution to handle the dense pedestrian detection problem for the case of single-stage detectors. Our method can also be flexibly applied to two-stage detectors, achieving notable performance enhancement.
翻译:在人群中,对行人进行突触探测是一项艰巨的任务,因为大量相互存在的人类案例,给古典物体探测方法中目前基于IoU的地面真相分配程序带来模糊和优化的困难。在本文中,我们开发了行人探测的独特视角,将其作为变异推理问题。我们通过将密集的建议建模为潜在变量,为行人探测设计了一种新颖而有效的算法,同时提出一个定制的自动编码变异波段算法。通过优化我们提议的算法,可将古典探测器制成变异行人探测器。对人群和城市人数据集进行的实验显示,拟议的算法是处理单阶段探测器中密集行人探测问题的有效解决方案。我们的方法还可以灵活适用于两阶段探测器,实现显著的性能增强。