Key challenges in developing generalized automatic emotion recognition systems include scarcity of labeled data and lack of gold-standard references. Even for the cues that are labeled as the same emotion category, the variability of associated expressions can be high depending on the elicitation context e.g., emotion elicited during improvised conversations vs. acted sessions with predefined scripts. In this work, we regard the emotion elicitation approach as domain knowledge, and explore domain transfer learning techniques on emotional utterances collected under different emotion elicitation approaches, particularly with limited labeled target samples. Our emotion recognition model combines the gradient reversal technique with an entropy loss function as well as the softlabel loss, and the experiment results show that domain transfer learning methods can be employed to alleviate the domain mismatch between different elicitation approaches. Our work provides new insights into emotion data collection, particularly the impact of its elicitation strategies, and the importance of domain adaptation in emotion recognition aiming for generalized systems.
翻译:开发通用自动情感识别系统的关键挑战包括标签数据稀缺和缺少金标准参考。即使标签标为同一情感类别的提示,相关表达方式的变异性也取决于诱发环境,例如简易对话与带有预定义脚本的操作会话中产生的情感。在这项工作中,我们认为情感感应法是域知识,探索在不同情感感应方法下收集的情绪发音的域传输学习技术,特别是有有限标签的目标样本。我们的情感识别模型将梯度逆转技术与诱变损失功能和软标签损失结合起来,实验结果显示,可以使用域传输学习方法来缓解不同感应方法之间的域错位。我们的工作为情感数据收集提供了新的洞见,特别是其引导战略的影响,以及域适应对于以通用系统为目标的情感识别的重要性。