Discourse relations are typically modeled as a discrete class that characterizes the relation between segments of text (e.g. causal explanations, expansions). However, such predefined discrete classes limits the universe of potential relationships and their nuanced differences. Analogous to contextual word embeddings, we propose representing discourse relations as points in high dimensional continuous space. However, unlike words, discourse relations often have no surface form (relations are between two segments, often with no word or phrase in that gap) which presents a challenge for existing embedding techniques. We present a novel method for automatically creating discourse relation embeddings (DiscRE), addressing the embedding challenge through a weakly supervised, multitask approach to learn diverse and nuanced relations between discourse segments in social media. Results show DiscRE can: (1) obtain the best performance on Twitter discourse relation classification task (macro F1=0.76) (2) improve the state of the art in social media causality prediction (from F1=.79 to .81), (3) perform beyond modern sentence and contextual word embeddings at traditional discourse relation classification, and (4) capture novel nuanced relations (e.g. relations semantically at the intersection of causal explanations and counterfactuals).
翻译:语言关系通常以分解的分类模式来建模,以描述文本各部分之间的关系(如因果解释、扩展等)。然而,这些预先定义的分解类别限制了潜在关系及其细微差异的广度。与背景字嵌入相类似,我们建议将话语关系作为高维连续空间的点。然而,与文字不同,话语关系往往没有表面形式(两个部分之间的关系,往往没有该差距中的文字或短语),给现有嵌入技术带来挑战。我们提出了一种新颖的方法,用于自动创建话语关系嵌入(DiscRE),通过监管不力的多任务方法应对嵌入的挑战,以学习社会媒体各部分话语层之间多样化和细细微关系。结果显示:(1) 在Twitter话语关系分类任务上取得最佳表现(macro F1=0.76) (2) 改善社会媒体因果关系预测(从F1=.79到.81),(3) 超越现代句和背景词嵌入传统话语关系分类,并(4) 抓住真实的因果关系。</s>