With the progress of sensor technology in wearables, the collection and analysis of PPG signals are gaining more interest. Using Machine Learning, the cardiac rhythm corresponding to PPG signals can be used to predict different tasks such as activity recognition, sleep stage detection, or more general health status. However, supervised learning is often limited by the amount of available labeled data, which is typically expensive to obtain. To address this problem, we propose a Self-Supervised Learning (SSL) method with a pretext task of signal reconstruction to learn an informative generalized PPG representation. The performance of the proposed SSL framework is compared with two fully supervised baselines. The results show that in a very limited label data setting (10 samples per class or less), using SSL is beneficial, and a simple classifier trained on SSL-learned representations outperforms fully supervised deep neural networks. However, the results reveal that the SSL-learned representations are too focused on encoding the subjects. Unfortunately, there is high inter-subject variability in the SSL-learned representations, which makes working with this data more challenging when labeled data is scarce. The high inter-subject variability suggests that there is still room for improvements in learning representations. In general, the results suggest that SSL may pave the way for the broader use of machine learning models on PPG data in label-scarce regimes.
翻译:利用机器学习,与PPG信号相对应的心脏节奏可以用来预测不同的任务,例如活动识别、睡眠阶段检测或一般健康状况;然而,监督学习往往受到现有标签数据数量的限制,这些数据通常昂贵,要解决这一问题,我们建议采用自我监督学习方法,以信号重建为借口,学习信息丰富的通用PPG代表制。拟议的SSL框架的绩效与两个完全监督的基线相比较。结果显示,在非常有限的标签数据设置(每类10个或更少的样本)中,使用SSL(SL)是有用的,而一个简单的叙级人员在SSL(S)的学习表现方面受过培训,这比完全监督的深层神经网络要好。然而,结果显示,SSL(SSL)的学习表现过于侧重于对主题进行编码。不幸的是,SSL(SL)所学的表述中存在很高的跨主题差异,在标签数据缺乏时,与这一数据相比更具挑战性。高的跨主题变量表明,在SL(SL)模型中,仍然可以更广泛地学习SL(SL)模型。