在对话中认识情感事业 (Recognizing Emotion Cause in Conversations)

Soujanya Poria,Navonil Majumder,Devamanyu Hazarika,Deepanway Ghosal,Rishabh Bhardwaj,Samson Yu Bai Jian,Pengfei Hong,Romila Ghosh,Abhinaba Roy,Niyati Chhaya,Alexander Gelbukh,Rada Mihalcea

from arxiv, https://github.com/declare-lab/RECCON, Accepted at Cognitive Computation

We address the problem of recognizing emotion cause in conversations, define two novel sub-tasks of this problem, and provide a corresponding dialogue-level dataset, along with strong Transformer-based baselines. The dataset is available at https://github.com/declare-lab/RECCON. Introduction: Recognizing the cause behind emotions in text is a fundamental yet under-explored area of research in NLP. Advances in this area hold the potential to improve interpretability and performance in affect-based models. Identifying emotion causes at the utterance level in conversations is particularly challenging due to the intermingling dynamics among the interlocutors. Method: We introduce the task of Recognizing Emotion Cause in CONversations with an accompanying dataset named RECCON, containing over 1,000 dialogues and 10,000 utterance cause-effect pairs. Furthermore, we define different cause types based on the source of the causes, and establish strong Transformer-based baselines to address two different sub-tasks on this dataset: causal span extraction and causal emotion entailment. Result: Our Transformer-based baselines, which leverage contextual pre-trained embeddings, such as RoBERTa, outperform the state-of-the-art emotion cause extraction approaches Conclusion: We introduce a new task highly relevant for (explainable) emotion-aware artificial intelligence: recognizing emotion cause in conversations, provide a new highly challenging publicly available dialogue-level dataset for this task, and give strong baseline results on this dataset.

翻译：我们处理在谈话中认识情感原因的问题,界定这一问题的两个新的子任务,并提供相应的对话层面的数据集,同时提供强有力的基于变异器的基线。数据集可在https://github.com/declare-lab/RECCON上查阅。导言:认识到文字中情感原因是一个根本性但探索不足的研究领域。这一领域的进展有可能改善基于影响模型的可解释性和性。在谈话中找出语调层面的情感原因特别具有挑战性,因为对话者之间的动态交错。方法:我们推出在谈话中识别情感原因的任务,同时配有名为RECCON的配套数据集,包含1 000多个对话和10 000对言语因果关系配对。此外,我们根据根源界定了不同的原因类型,并建立了强有力的基于变异源的基线,以解决该数据集上的两个不同的子任务:因地跨度提取和因果情感情感。结果:基于我们的变异状态基线,它利用了背景的牢固嵌入层,如ROBAR-ROCON,为高额的智能分析结果提供高分辨率分析结果。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。