We address the problem of crowd localization, i.e., the prediction of dots corresponding to people in a crowded scene. Due to various challenges, a localization method is prone to spatial semantic errors, i.e., predicting multiple dots within a same person or collapsing multiple dots in a cluttered region. We propose a topological approach targeting these semantic errors. We introduce a topological constraint that teaches the model to reason about the spatial arrangement of dots. To enforce this constraint, we define a persistence loss based on the theory of persistent homology. The loss compares the topographic landscape of the likelihood map and the topology of the ground truth. Topological reasoning improves the quality of the localization algorithm especially near cluttered regions. On multiple public benchmarks, our method outperforms previous localization methods. Additionally, we demonstrate the potential of our method in improving the performance in the crowd counting task.
翻译:我们处理人群本地化问题,即预测与人群拥挤的场景相对应的点。由于各种挑战,本地化方法容易发生空间语义错误,即预测同一人内部的多点,或者在一个杂乱无章的区域摧毁多点。我们建议了针对这些语义错误的地形学方法。我们引入了一种地形学限制,使模型能够理解点的空间安排。为了执行这一限制,我们根据持久性同质学理论定义了持久性损失。损失比较了概率地图的地形景观和地面真相的地形学。地形学推理提高了本地化算法的质量,特别是在交错区域附近。在多个公共基准上,我们的方法优于以前的本地化方法。此外,我们展示了我们改进人群计数任务绩效的方法的潜力。