Detecting persons in images or video with neural networks is a well-studied subject in literature. However, such works usually assume the availability of a camera of decent resolution and a high-performance processor or GPU to run the detection algorithm, which significantly increases the cost of a complete detection system. However, many applications require low-cost solutions, composed of cheap sensors and simple microcontrollers. In this paper, we demonstrate that even on such hardware we are not condemned to simple classic image processing techniques. We propose a novel ultra-lightweight CNN-based person detector that processes thermal video from a low-cost 32x24 pixel static imager. Trained and compressed on our own recorded dataset, our model achieves up to 91.62% accuracy (F1-score), has less than 10k parameters, and runs as fast as 87ms and 46ms on low-cost microcontrollers STM32F407 and STM32F746, respectively.
翻译:在神经网络的图像或视频中检测人员是文献中研究周密的主题,然而,这类工作通常假定有一台体面分辨率的照相机和一个高性能处理器或GPU来操作探测算法,这大大增加了整个探测系统的成本;然而,许多应用需要低成本的解决方案,由廉价传感器和简单的微控制器组成;在本文中,我们证明,即使在这些硬件上,我们也不必采用简单的经典图像处理技术;我们提议建立一个新型超轻量CNN人探测器,从一个低成本的32x24像素静态成像仪中处理热视频。在我们自己的记录数据集中培训和压缩,我们的模型达到91.62%的精度(F1芯),有不到10公里的参数,在低成本微控制器STM32F407和STM32F7466上运行速度高达87米和46米。