Remote photoplethysmography (rPPG) enables non-contact heart rate (HR) estimation from facial videos which gives significant convenience compared with traditional contact-based measurements. In the real-world long-term health monitoring scenario, the distance of the participants and their head movements usually vary by time, resulting in the inaccurate rPPG measurement due to the varying face resolution and complex motion artifacts. Different from the previous rPPG models designed for a constant distance between camera and participants, in this paper, we propose two plug-and-play blocks (i.e., physiological signal feature extraction block (PFE) and temporal face alignment block (TFA)) to alleviate the degradation of changing distance and head motion. On one side, guided with representative-area information, PFE adaptively encodes the arbitrary resolution facial frames to the fixed-resolution facial structure features. On the other side, leveraging the estimated optical flow, TFA is able to counteract the rPPG signal confusion caused by the head movement thus benefit the motion-robust rPPG signal recovery. Besides, we also train the model with a cross-resolution constraint using a two-stream dual-resolution framework, which further helps PFE learn resolution-robust facial rPPG features. Extensive experiments on three benchmark datasets (UBFC-rPPG, COHFACE and PURE) demonstrate the superior performance of the proposed method. One highlight is that with PFE and TFA, the off-the-shelf spatio-temporal rPPG models can predict more robust rPPG signals under both varying face resolution and severe head movement scenarios. The codes are available at https://github.com/LJW-GIT/Arbitrary_Resolution_rPPG.
翻译:在现实世界长期健康监测情景中,参与者的距离及其头部运动通常随时间变化而变化,导致对 RPPG 的测量不准确,因为面部分辨率和运动结构的特征各不相同。与先前为相机和参与者之间保持恒定距离而设计的 RPPG 模型不同,在本文件中,我们提议使用两个插接和播放的心心率估计块(即生理信号特征提取块和时间面部校正校准块(TFA),以缓解移动的距离和头部运动的退化。一方面,以代表性区域信息为指导,PFE 将任意分辨率面部框架与固定分辨率的面部结构特征相适应。在另一方面,利用估计的光学流,TFA能够抵消RPG 由头部移动造成的信号混淆,从而有利于运动-Robet RPG 信号回收。此外,我们还用双向分辨率显示的PPPG-G OFG 软性能模型,通过双流的双向G G 分辨率框架,帮助在双向的MAG 分辨率分辨率分辨率框架中学习双向的MA-G 。