In intelligent building management, knowing the number of people and their location in a room are important for better control of its illumination, ventilation, and heating with reduced costs and improved comfort. This is typically achieved by detecting people using compact embedded devices that are installed on the room's ceiling, and that integrate low-resolution infrared camera, which conceals each person's identity. However, for accurate detection, state-of-the-art deep learning models still require supervised training using a large annotated dataset of images. In this paper, we investigate cost-effective methods that are suitable for person detection based on low-resolution infrared images. Results indicate that for such images, we can reduce the amount of supervision and computation, while still achieving a high level of detection accuracy. Going from single-shot detectors that require bounding box annotations of each person in an image, to auto-encoders that only rely on unlabelled images that do not contain people, allows for considerable savings in terms of annotation costs, and for models with lower computational costs. We validate these experimental findings on two challenging top-view datasets with low-resolution infrared images.
翻译:在智能建筑管理中,了解人的数量及其在室内的位置对于更好地控制其照明、通风和取暖非常重要,而且费用降低,舒适度提高。这通常是通过使用安装在室内天花板上的紧固嵌嵌入装置,并结合低分辨率红外摄像头,从而隐藏每个人的身份,来检测人们。然而,为了准确检测,最先进的深层学习模式仍然需要使用大量附加注释的图像数据集进行监管培训。在本文中,我们调查适合根据低分辨率红外图像进行个人检测的成本效益高的方法。结果显示,对于这些图像,我们可以减少监督和计算的数量,同时仍能达到高度的检测准确度。从需要每个图像中的每个人带框说明的单发探测器,到仅依赖不包含人的未贴标签图像的自动编码器,可以节省大量注解费用和较低计算成本的模型。我们用低分辨率红红外图像这两个具有挑战性的顶视数据集验证了这些实验结果。