使用t分发手段确认言论情感识别的标签不确定性建模和预测 (Label Uncertainty Modeling and Prediction for Speech Emotion Recognition using t-Distributions)

As different people perceive others' emotional expressions differently, their annotation in terms of arousal and valence are per se subjective. To address this, these emotion annotations are typically collected by multiple annotators and averaged across annotators in order to obtain labels for arousal and valence. However, besides the average, also the uncertainty of a label is of interest, and should also be modeled and predicted for automatic emotion recognition. In the literature, for simplicity, label uncertainty modeling is commonly approached with a Gaussian assumption on the collected annotations. However, as the number of annotators is typically rather small due to resource constraints, we argue that the Gaussian approach is a rather crude assumption. In contrast, in this work we propose to model the label distribution using a Student's t-distribution which allows us to account for the number of annotations available. With this model, we derive the corresponding Kullback-Leibler divergence based loss function and use it to train an estimator for the distribution of emotion labels, from which the mean and uncertainty can be inferred. Through qualitative and quantitative analysis, we show the benefits of the t-distribution over a Gaussian distribution. We validate our proposed method on the AVEC'16 dataset. Results reveal that our t-distribution based approach improves over the Gaussian approach with state-of-the-art uncertainty modeling results in speech-based emotion recognition, along with an optimal and even faster convergence.

翻译：不同的人对他人的情感表达方式有不同的看法,不同的人对他人的情感表达方式有不同的看法,因此,他们用振奋和价值的描述本身是主观的。为了解决这个问题,这些情感说明通常由多个注解者收集,并在注解者中平均收集,以获得振奋和价值的标签。然而,除了一般情况之外,标签的不确定性也是值得注意的,并且应当建模和预测,以便自动认识情绪。在文献中,为了简单起见,标签的不确定性模型通常与所收集的注解者的假设进行对比。然而,由于资源限制,注解者的数量通常相当少,因此我们认为,高估方法是一个相当粗略的假设。与此相反,我们建议用学生的图解说来模拟标签的分布方式,从而使我们能够对可用的图解数量进行计算。我们用基于 Kullback-Lever 模型计算出相应的基于损失的模型功能,并用它来训练一个基于情绪标签分配的估算师,从中可以推断出平均和不确定性的方法。通过定性和定量分析,我们用定量分析,我们用高估方法来验证了我们提出的高估的压结果,我们根据高估的成绩的图表的判分化方法,我们的数据。

相关内容

MoDELS

关注 0

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日