Understanding causality is key to the success of NLP applications, especially in high-stakes domains. Causality comes in various perspectives such as enable and prevent that, despite their importance, have been largely ignored in the literature. This paper introduces a novel fine-grained causal reasoning dataset and presents a series of novel predictive tasks in NLP, such as causality detection, event causality extraction, and Causal QA. Our dataset contains human annotations of 25K cause-effect event pairs and 24K question-answering pairs within multi-sentence samples, where each can have multiple causal relationships. Through extensive experiments and analysis, we show that the complex relations in our dataset bring unique challenges to state-of-the-art methods across all three tasks and highlight potential research opportunities, especially in developing "causal-thinking" methods.
翻译:理解因果关系是NLP应用成功的关键,特别是在高取量领域。 原因来自各种观点,例如帮助和预防,尽管重要,但文献中基本上忽视了这一点。 本文介绍了一个新的细微的因果关系推理数据集,并介绍了NLP的一系列新颖的预测任务,如因果关系检测、事件因果关系提取和因果质A。 我们的数据集包含25K因果配对和24K问答配对在多存量样本中的人文说明,每个样本中都有多重因果关系。 通过广泛的实验和分析,我们显示我们数据集的复杂关系对所有三项任务中的最新方法提出了独特的挑战,并凸显了潜在的研究机会,特别是在开发“因果思考”方法方面。