Advances in computer vision and machine learning techniques have led to significant development in 2D and 3D human pose estimation from RGB cameras, LiDAR, and radars. However, human pose estimation from images is adversely affected by occlusion and lighting, which are common in many scenarios of interest. Radar and LiDAR technologies, on the other hand, need specialized hardware that is expensive and power-intensive. Furthermore, placing these sensors in non-public areas raises significant privacy concerns. To address these limitations, recent research has explored the use of WiFi antennas (1D sensors) for body segmentation and key-point body detection. This paper further expands on the use of the WiFi signal in combination with deep learning architectures, commonly used in computer vision, to estimate dense human pose correspondence. We developed a deep neural network that maps the phase and amplitude of WiFi signals to UV coordinates within 24 human regions. The results of the study reveal that our model can estimate the dense pose of multiple subjects, with comparable performance to image-based approaches, by utilizing WiFi signals as the only input. This paves the way for low-cost, broadly accessible, and privacy-preserving algorithms for human sensing.
翻译:计算机视觉和机器学习技术的进步导致对2D和3D人的2D和3D人造图象根据RGB摄像机、LIDAR和雷达进行重大估计,然而,图像对人造图象的估计受到许多感兴趣情景中常见的封闭和照明的不利影响。雷达和LIDAR技术需要昂贵和电力密集的专门硬件。此外,将这些传感器放在非公共领域引起重大的隐私问题。为解决这些限制,最近的研究探索了如何使用无线Fi天线(1D传感器)进行身体分解和关键物体探测。本文进一步扩展了WiFi信号的使用,并结合计算机视觉中常用的深层学习结构来估计密集的人造相通信。我们开发了一个深度的神经网络,绘制了WiFi信号在24个人类区域坐标上的阶段和振荡。研究结果显示,我们的模型可以估计多种主题的密度,其性能与基于图像的方法相当,使用WiFi信号作为唯一的投入。这为低成本、可广泛获取的和保密的人类算法铺设了道路铺设了道路。