We present a lightweight neural model for remote heart rate estimation focused on the efficient spatio-temporal learning of facial photoplethysmography (PPG) based on i) modelling of PPG dynamics by combinations of multiple convolutional derivatives, and ii) increased flexibility of the model to learn possible offsets between the facial video PPG and the ground truth. PPG dynamics are modelled by a Temporal Derivative Module (TDM) constructed by the incremental aggregation of multiple convolutional derivatives, emulating a Taylor series expansion up to the desired order. Robustness to ground truth offsets is handled by the introduction of TALOS (Temporal Adaptive LOcation Shift), a new temporal loss to train learning-based models. We verify the effectiveness of our model by reporting accuracy and efficiency metrics on the public PURE and UBFC-rPPG datasets. Compared to existing models, our approach shows competitive heart rate estimation accuracy with a much lower number of parameters and lower computational cost.
翻译:我们提出了一个用于远程心率估算的轻量级神经模型,其重点是根据多种进化衍生物的组合,对PPG动态进行模型模拟,对面部光膜摄影学(PPG)进行有效的时空学习,以及提高模型的灵活性,以了解面部视频PPG和地面真相之间可能的抵消。PPPG动态模型由一种时间衍生模块(TDM)模拟,该模块由多种转动衍生物的递增聚合所构建,将泰勒系列扩展至预期的顺序。通过采用TALOS(临时适应定位转换)这一培训学习模型的新时间损失,对地面真相进行强力抵消。我们通过报告公共PURE和UBFC-rPPG数据集的准确性和效率指标,核查我们的模型的有效性。与现有模型相比,我们的方法显示了具有竞争力的心率估算精度,参数数量要低得多,计算成本要低。