Estimation of pain intensity from facial expressions captured in videos has an immense potential for health care applications. Given the challenges related to subjective variations of facial expressions, and operational capture conditions, the accuracy of state-of-the-art DL models for recognizing facial expressions may decline. Domain adaptation has been widely explored to alleviate the problem of domain shifts that typically occur between video data captured across various source and target domains. Moreover, given the laborious task of collecting and annotating videos, and subjective bias due to ambiguity among adjacent intensity levels, weakly-supervised learning is gaining attention in such applications. State-of-the-art WSL models are typically formulated as regression problems, and do not leverage the ordinal relationship among pain intensity levels, nor temporal coherence of multiple consecutive frames. This paper introduces a new DL model for weakly-supervised DA with ordinal regression that can be adapted using target domain videos with coarse labels provided on a periodic basis. The WSDA-OR model enforces ordinal relationships among intensity levels assigned to target sequences, and associates multiple relevant frames to sequence-level labels. In particular, it learns discriminant and domain-invariant feature representations by integrating multiple instance learning with deep adversarial DA, where soft Gaussian labels are used to efficiently represent the weak ordinal sequence-level labels from target domain. The proposed approach was validated using RECOLA video dataset as fully-labeled source domain data, and UNBC-McMaster shoulder pain video dataset as weakly-labeled target domain data. We have also validated WSDA-OR on BIOVID and Fatigue datasets for sequence level estimation.
翻译:在视频中拍摄的面部表达式对疼痛强度的估测具有巨大的潜力。鉴于与面部表达式的主观变异和操作性捕捉条件相关的挑战,用于识别面部表达式的最先进的DL模型的准确性可能会下降。已经广泛探索了对域的调整,以缓解在各种源和目标域采集的视频数据之间通常发生的域变迁问题。此外,由于收集和批注视频的艰巨任务,以及相邻强度水平的模糊性造成的主观偏差,在这些应用程序中,微弱监督的学习越来越受到关注。最先进的WSDA-OR模型通常被设计成回归问题,而不会在疼痛强度水平、或多个连续框架的时间一致性之间利用正态关系。本文引入了新的DLAL模型,用于低度的调调调调调调调DADA, 以低温的标签为标准,我们从目标源值序列中执行或惯性关系,并且将多个相关框架与序列级级的Oral-al-al-al-al-lader-ladeal-lader-deal-deal-deal-deal-deal-deal-deal-deal-deal-dealal-deal-deal-deal-deal-deal-deal-lader-deal-deal-deal-deal-deal-deal-deal-deal-deal-deal-deal-deal-deal-deal-deal-deal-deal-deal-deal-deal-deal-deal-deal-deal-al-al-al-al-al-deal-deal-al-al-deal-deal-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-dealdaldald-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-