Expression of emotions is a crucial part of daily human communication. Emotion recognition in conversations (ERC) is an emerging field of study, where the primary task is to identify the emotion behind each utterance in a conversation. Though a lot of work has been done on ERC in the past, these works only focus on ERC in the English language, thereby ignoring any other languages. In this paper, we present Multilingual MELD (M-MELD), where we extend the Multimodal EmotionLines Dataset (MELD) \cite{poria2018meld} to 4 other languages beyond English, namely Greek, Polish, French, and Spanish. Beyond just establishing strong baselines for all of these 4 languages, we also propose a novel architecture, DiscLSTM, that uses both sequential and conversational discourse context in a conversational dialogue for ERC. Our proposed approach is computationally efficient, can transfer across languages using just a cross-lingual encoder, and achieves better performance than most uni-modal text approaches in the literature on both MELD and M-MELD. We make our data and code publicly on GitHub.
翻译:情感表达是人类日常交流的重要组成部分。对话情感识别(ERC)是一个新兴的研究领域,其主要任务是识别对话中每个话语背后的情感。尽管过去已经有很多关于ERC的研究工作,但这些工作仅关注英语中的ERC,因此忽略了其他语言。在本文中,我们提出了一个名为多语言MELD(M-MELD)的数据集,将多模态EmotionLines数据集(MELD)\cite{poria2018meld}扩展到英语之外的其他4种语言,即希腊语、波兰语、法语和西班牙语。除了针对这4种语言建立强的基线之外,我们还提出了一种新颖的架构DiscLSTM,在对话中使用连续和交互式话语上下文进行ERC。我们的方法在计算效率上非常高,只需使用跨语言编码器即可在各种语言之间进行传输,并且在MELD和M-MELD上的性能均优于文献中大多数单模态文本方法。我们将我们的数据和代码公开于GitHub。