Crowd counting is a task worth exploring in modern society because of its wide applications such as public safety and video monitoring. Many CNN-based approaches have been proposed to improve the accuracy of estimation, but there are some inherent issues affect the performance, such as overfitting and details lost caused by pooling layers. To tackle these problems, in this paper, we propose an effective network called MDSNet, which introduces a novel supervision framework called Multi-channel Deep Supervision (MDS). The MDS conducts channel-wise supervision on the decoder of the estimation model to help generate the density maps. To obtain the accurate supervision information of different channels, the MDSNet employs an auxiliary network called SupervisionNet (SN) to generate abundant supervision maps based on existing groundtruth. Besides the traditional density map supervision, we also use the SN to convert the dot annotations into continuous supervision information and conduct dot supervision in the MDSNet. Extensive experiments on several mainstream benchmarks show that the proposed MDSNet achieves competitive results and the MDS significantly improves the performance without changing the network structure.
翻译:在现代社会,计票是一项值得探索的任务,因为其应用范围很广,如公共安全和视频监测等。许多有线电视新闻网采用的方法是为了提高估算的准确性。许多有线电视新闻网采用的方法是为了提高估算的准确性,但有一些固有的问题影响到了业绩,例如,由于集合层的过度配置和细节损失。为了解决这些问题,我们在本文件中建议建立一个名为MDSNet的有效网络,这个网络引入一个叫作多通道深度监督(MDS)的新式监督框架。MDS对估算模型的分解器进行有通道的监控,以帮助绘制密度地图。为了获得不同频道的准确监督信息,MDSNet使用一个称为“监督网”(SN)的辅助网络,根据现有的地面图谱绘制大量的监督地图。除了传统的密度地图监督外,我们还利用SNDN将点说明转换为连续的监督信息,并在MDSNet中进行点监督。对几个主流基准进行的广泛实验表明,拟议的MDSNet取得了竞争性的结果,而MDS系统在不改变网络结构的情况下大大改进了业绩。