Target-oriented opinion words extraction (TOWE) is a subtask of aspect-based sentiment analysis (ABSA). Given a sentence and an aspect term occurring in the sentence, TOWE extracts the corresponding opinion words for the aspect term. TOWE has two types of instance. In the first type, aspect terms are associated with at least one opinion word, while in the second type, aspect terms do not have corresponding opinion words. However, previous researches trained and evaluated their models with only the first type of instance, resulting in a sample selection bias problem. Specifically, TOWE models were trained with only the first type of instance, while these models would be utilized to make inference on the entire space with both the first type of instance and the second type of instance. Thus, the generalization performance will be hurt. Moreover, the performance of these models on the first type of instance cannot reflect their performance on entire space. To validate the sample selection bias problem, four popular TOWE datasets containing only aspect terms associated with at least one opinion word are extended and additionally include aspect terms without corresponding opinion words. Experimental results on these datasets show that training TOWE models on entire space will significantly improve model performance and evaluating TOWE models only on the first type of instance will overestimate model performance.
翻译:面向目标的见解文字提取(TOWE)是该句中出现的一个句子和一个方面术语的子任务。根据该句中出现的一个句子和一个方面术语,TOWE提取了该句中的相应意见字词。TOWE有两种实例。在第一类中,方面术语至少与一个意见字有关,而在第二类中,方面术语没有相应的意见字词。然而,以往经过培训和评价的模型只有第一种例子,导致选择偏差问题抽样。具体地说,对TOWE模型只进行了第一种例子的培训,而这些模型将被用于用第一种实例和第二种实例对整个空间空间进行推论。因此,一般性能将受到损害。此外,这些模式在第一类中的表现不能反映其在整个空间的性能。为验证抽样选择偏差问题,四套流行的TOWE数据集仅包含与至少一种意见字眼有关的方面术语,而且还包括一些没有相应意见字眼的方面。这些数据模型的实验结果显示,仅培训TOWE模型将大大改进整个空间状况模型。