In this work, we focus on dialogue reading comprehension (DRC), a task extracting answer spans for questions from dialogues. Dialogue context modeling in DRC is tricky due to complex speaker information and noisy dialogue context. To solve the two problems, previous research proposes two self-supervised tasks respectively: guessing who a randomly masked speaker is according to the dialogue and predicting which utterance in the dialogue contains the answer. Although these tasks are effective, there are still urging problems: (1) randomly masking speakers regardless of the question cannot map the speaker mentioned in the question to the corresponding speaker in the dialogue, and ignores the speaker-centric nature of utterances. This leads to wrong answer extraction from utterances in unrelated interlocutors' scopes; (2) the single utterance prediction, preferring utterances similar to the question, is limited in finding answer-contained utterances not similar to the question. To alleviate these problems, we first propose a new key utterances extracting method. It performs prediction on the unit formed by several contiguous utterances, which can realize more answer-contained utterances. Based on utterances in the extracted units, we then propose Question-Interlocutor Scope Realized Graph (QuISG) modeling. As a graph constructed on the text of utterances, QuISG additionally involves the question and question-mentioning speaker names as nodes. To realize interlocutor scopes, speakers in the dialogue are connected with the words in their corresponding utterances. Experiments on the benchmarks show that our method can achieve better and competitive results against previous works.
翻译:在这项工作中,我们侧重于对话阅读理解(DRC),这是为对话提问提取答案的任务。刚果民主共和国的对话背景模型由于复杂的演讲者信息和吵闹的对话背景而变得棘手。为了解决这两个问题,先前的研究分别提出两项自我监督的任务:猜测谁是一个随机遮蔽的演讲者,并预测对话中的言论包含答案。虽然这些任务是有效的,但仍然有敦促的问题:(1) 随机地掩盖演讲者,而不管问题是什么,无法将问题中提到的演讲者映射到对话中的相应演讲者身上,而忽略了演讲的语调中心性质。这导致从无关的对话者范围内的演讲中错误地解答;(2) 单一的说性预测,倾向于与问题相似的说词,在寻找与问题相似的回答内容时受到限制。为了缓解这些问题,我们首先提出一个新的关键语调解解方法。它用几个连结的演讲者组成一个单位,可以实现更准确的言语调,而忽略了话语调的本质。基于提取的语调调调,我们提出“Gloveal Queral ” 和“Gal-Lisal-I”的直径,我们用了“Lism-Lisal-Lisal-Lisal-I”中的直判法,我们用了“Simal-I 的直径化的直径”的标取了“Lisal-Lisal-Lisal-I-I-I-I-I-I-Lisal-Lisal-I-I-I-I-Lisal-I-I-I-Lisal-Lisal-Lisal-I-I-I-I-I-I-Lisal-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-