超越单词重要性:从LSTMs提取互动的因地拆解 (Beyond Word Importance: Contextual Decomposition to Extract Interactions from LSTMs)

The driving force behind the recent success of LSTMs has been their ability to learn complex and non-linear relationships. Consequently, our inability to describe these relationships has led to LSTMs being characterized as black boxes. To this end, we introduce contextual decomposition (CD), an interpretation algorithm for analysing individual predictions made by standard LSTMs, without any changes to the underlying model. By decomposing the output of a LSTM, CD captures the contributions of combinations of words or variables to the final prediction of an LSTM. On the task of sentiment analysis with the Yelp and SST data sets, we show that CD is able to reliably identify words and phrases of contrasting sentiment, and how they are combined to yield the LSTM's final prediction. Using the phrase-level labels in SST, we also demonstrate that CD is able to successfully extract positive and negative negations from an LSTM, something which has not previously been done.

翻译：LSTMs最近成功背后的驱动力是它们学习复杂和非线性关系的能力。因此,我们无法描述这些关系导致LSTMs被定性为黑盒。为此,我们引入了背景分解(CD),这是用来分析标准LSTMs所作的个别预测的一种解释算法,没有改变基本模型。通过分解LSTM的输出,CD捕捉了单词或变量组合对LSTM最后预测的贡献。关于与Yelp和SST数据集的情绪分析任务,我们显示CD能够可靠地识别反差情绪的文字和短语,以及它们如何结合得出LSTM的最后预测。我们使用SST中的语级标签,还表明CD能够成功地从LSTM中获取正反效果,这是以前没有做到的事情。

相关内容

长短期记忆网络

关注 120

长短期记忆网络(LSTM)是一种用于深度学习领域的人工回归神经网络(RNN)结构。与标准的前馈神经网络不同，LSTM具有反馈连接。它不仅可以处理单个数据点(如图像)，还可以处理整个数据序列(如语音或视频)。例如，LSTM适用于未分段、连接的手写识别、语音识别、网络流量或IDSs(入侵检测系统)中的异常检测等任务。

【反馈循环自编码器】FEEDBACK RECURRENT AUTOENCODER

专知会员服务

23+阅读 · 2020年1月28日

【ECML-PKDD 2019】多维时间序列和事件日志的模式挖掘和异常检测框架（A framework for pattern mining and anomalydetection in multi-dimensional time series andevent logs）

专知会员服务

38+阅读 · 2019年12月1日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日