Aspect-based sentiment analysis (ABSA) aims to predict the sentiment expressed in a review with respect to a given aspect. The core of ABSA is to model the interaction between the context and given aspect to extract the aspect-related information. In prior work, attention mechanisms and dependency graph networks are commonly adopted to capture the relations between the context and given aspect. And the weighted sum of context hidden states is used as the final representation fed to the classifier. However, the information related to the given aspect may be already discarded and adverse information may be retained in the context modeling processes of existing models. This problem cannot be solved by subsequent modules and there are two reasons: first, their operations are conducted on the encoder-generated context hidden states, whose value cannot change after the encoder; second, existing encoders only consider the context while not the given aspect. To address this problem, we argue the given aspect should be considered as a new clue out of context in the context modeling process. As for solutions, we design several aspect-aware context encoders based on different backbones: an aspect-aware LSTM and three aspect-aware BERTs. They are dedicated to generate aspect-aware hidden states which are tailored for ABSA task. In these aspect-aware context encoders, the semantics of the given aspect is used to regulate the information flow. Consequently, the aspect-related information can be retained and aspect-irrelevant information can be excluded in the generated hidden states. We conduct extensive experiments on several benchmark datasets with empirical analysis, demonstrating the efficacies and advantages of our proposed aspect-aware context encoders.
翻译:以外观为基础的情绪分析(ABSA)旨在预测在审查中就某一特定方面表达的情绪。ABSA的核心是模拟上下文和特定方面的相互作用,以提取与方面有关的信息。在先前的工作中,通常采用注意机制和依赖图形网络,以捕捉上下文和特定方面之间的关系。而隐藏状态的加权总和被作为向分类者提供的最后表述材料。然而,与特定方面有关的信息可能被抛弃,现有模型的背景建模过程中可能保留不良信息。这个问题无法通过随后的模块加以解决,原因有二:第一,其操作是在编码器生成的环境隐藏状态上进行,其价值在编码器之后无法改变;第二,现有编码器仅考虑上下文而不是特定方面。为解决这一问题,我们争论将特定方面视为背景建模过程中的一个新的线索。关于解决方案,我们设计了多个直观背景背景分析器,以不同的主干线为基础:一个对LSTM的识别面值进行实验,而三个方面是显示我们当前运行的运行过程中的数据,因此,这些方面是用于STM和BAR任务中的某些方面。