Crowd counting is a challenging task due to the issues such as scale variation and perspective variation in real crowd scenes. In this paper, we propose a novel Cascaded Residual Density Network (CRDNet) in a coarse-to-fine approach to generate the high-quality density map for crowd counting more accurately. (1) We estimate the residual density maps by multi-scale pyramidal features through cascaded residual density modules. It can improve the quality of density map layer by layer effectively. (2) A novel additional local count loss is presented to refine the accuracy of crowd counting, which reduces the errors of pixel-wise Euclidean loss by restricting the number of people in the local crowd areas. Experiments on two public benchmark datasets show that the proposed method achieves effective improvement compared with the state-of-the-art methods.
翻译:人群计数是一项艰巨的任务,因为实际人群场景的规模变异和视角变异等问题。在本文中,我们提议采用粗略到简略的方法建立一个新型的连锁残余密度网络(CRDNet ), 以产生高质量的人口密度图,以便更准确地进行人群计数。 (1) 我们通过级联残余密度模块,用多级金字塔特征来估计残余密度图,通过级联残余密度模块有效地提高密度图层的质量。 (2) 提出新颖的额外本地计数损失,以通过限制当地人群区的人数,来提高人群计数的准确性,从而减少与像素一样的Euclidean损失的误差。关于两个公共基准数据集的实验表明,与最新方法相比,拟议方法取得了有效的改进。