Cross-domain sentiment classification has been a hot spot these years, which aims to learn a reliable classifier using labeled data from a source domain and evaluate it on a target domain. In this vein, most approaches utilized domain adaptation that maps data from different domains into a common feature space. To further improve the model performance, several methods targeted to mine domain-specific information were proposed. However, most of them only utilized a limited part of domain-specific information. In this study, we first develop a method of extracting domain-specific words based on the topic information derived from topic models. Then, we propose a Topic Driven Adaptive Network (TDAN) for cross-domain sentiment classification. The network consists of two sub-networks: a semantics attention network and a domain-specific word attention network, the structures of which are based on transformers. These sub-networks take different forms of input and their outputs are fused as the feature vector. Experiments validate the effectiveness of our TDAN on sentiment classification across domains. Case studies also indicate that topic models have the potential to add value to cross-domain sentiment classification by discovering interpretable and low-dimensional subspaces.
翻译:这些年来,跨域情绪分类一直是热点,目的是利用来源域的标签数据学习可靠的分类器,并在目标域进行评价。在这方面,大多数方法都利用了将不同领域的数据映射成共同特征空间的域性调整方法。为了进一步改进模型性能,提出了针对地雷领域特定信息的几种方法。然而,大多数方法只使用了特定领域信息中有限的部分。在这项研究中,我们首先根据专题模型产生的专题信息,开发了一种提取特定域词的方法。然后,我们提出了跨域感应分类的“主题驱动适应网络”(TDAN) 。该网络由两个子网络组成:一个语义关注网络和一个特定域的文字关注网络,其结构以变异器为基础。这些子网络采用不同的输入形式,其输出结果与特性矢量相结合。实验证实了我们的TDAN在跨域的感应感分类方面的有效性。案例研究还表明,主题模型通过发现可解释和低维度子空间,有可能增加交叉感应力分类的价值。