Causal BERT : 文本所表述的事件之间的因果关系检测语言模型 (Causal BERT : Language models for causality detection between events expressed in text)

Causality understanding between events is a critical natural language processing task that is helpful in many areas, including health care, business risk management and finance. On close examination, one can find a huge amount of textual content both in the form of formal documents or in content arising from social media like Twitter, dedicated to communicating and exploring various types of causality in the real world. Recognizing these "Cause-Effect" relationships between natural language events continues to remain a challenge simply because it is often expressed implicitly. Implicit causality is hard to detect through most of the techniques employed in literature and can also, at times be perceived as ambiguous or vague. Also, although well-known datasets do exist for this problem, the examples in them are limited in the range and complexity of the causal relationships they depict especially when related to implicit relationships. Most of the contemporary methods are either based on lexico-semantic pattern matching or are feature-driven supervised methods. Therefore, as expected these methods are more geared towards handling explicit causal relationships leading to limited coverage for implicit relationships and are hard to generalize. In this paper, we investigate the language model's capabilities for causal association among events expressed in natural language text using sentence context combined with event information, and by leveraging masked event context with in-domain and out-of-domain data distribution. Our proposed methods achieve the state-of-art performance in three different data distributions and can be leveraged for extraction of a causal diagram and/or building a chain of events from unstructured text.

翻译：事件之间的因果关系理解是一个至关重要的自然语言处理任务,在许多领域都有帮助,包括保健、商业风险管理和财政。仔细检查后,人们可以发现大量文字内容,既有正式文件的形式,也有诸如Twitter等社交媒体的内容,专门用于沟通和探索现实世界中各种类型的因果关系。认识到自然语言事件之间的这些“原因-影响”关系,这仍然是一项挑战,因为它往往暗含地表达。隐含的因果关系很难通过文献中所使用的大多数技术来发现,有时也被认为是模糊或模糊的。此外,尽管对这个问题确实存在众所周知的数据集,但其中的例子在它们所描述的因果关系的范围和复杂性方面是有限的,特别是在与隐含关系有关的情况下。当代方法要么基于语言-语义模式的匹配,要么是以特征驱动的监管方法。因此,由于预期这些方法更倾向于处理明确的因果关系,导致对隐含关系的报道有限,而且难以概括。在本文中,我们从不熟悉的语言模型中调查了在自然语言序列中构建因果联系的因果关系和因果图流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流离流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流流。