具有经常内存的常务常务普通话转换器 (Paragraph-level Commonsense Transformers with Recurrent Memory)

Human understanding of narrative texts requires making commonsense inferences beyond what is stated explicitly in the text. A recent model, COMET, can generate such implicit commonsense inferences along several dimensions such as pre- and post-conditions, motivations, and mental states of the participants. However, COMET was trained on commonsense inferences of short phrases, and is therefore discourse-agnostic. When presented with each sentence of a multi-sentence narrative, it might generate inferences that are inconsistent with the rest of the narrative. We present the task of discourse-aware commonsense inference. Given a sentence within a narrative, the goal is to generate commonsense inferences along predefined dimensions, while maintaining coherence with the rest of the narrative. Such large-scale paragraph-level annotation is hard to get and costly, so we use available sentence-level annotations to efficiently and automatically construct a distantly supervised corpus. Using this corpus, we train PARA-COMET, a discourse-aware model that incorporates paragraph-level information to generate coherent commonsense inferences from narratives. PARA-COMET captures both semantic knowledge pertaining to prior world knowledge, and episodic knowledge involving how current events relate to prior and future events in a narrative. Our results show that PARA-COMET outperforms the sentence-level baselines, particularly in generating inferences that are both coherent and novel.

翻译：人类对叙述性案文的理解要求作出超出案文中明确表述内容的常识推理; 最近的一个模型,即知识与技术伦理学,可以在参与者的先期和后期条件、动机和精神状态等几个方面产生这种隐含的常识推理; 然而,知识与技术伦理学,在短句的常识推理方面受过培训,因此是讨论的不可知性; 当与多语种叙述的每句话一起提出时,它可能会产生与叙述性其余部分不一致的推论; 我们介绍讨论性常识推理的任务; 在叙述性的一个句子中,目标是在预先界定的层面产生常识推理,同时保持与叙述性其余部分的一致性; 然而,知识水平如此大的段落级注解很难获得且成本高昂,因此我们利用现有的句级说明,以高效和自动地构建一个远处监管性的内容; 我们利用这一材料,我们培训泛美网络,一个具有讨论性的模型,将段落级信息纳入一致的常识解论,从我们先前的叙述性知识中产生一致的常识,从我们以往的基线和历史记录中显示我们未来的了解。