Sentiment analysis is a costly yet necessary task for enterprises to study the opinions of their customers to improve their products and to determine optimal marketing strategies. Due to the existence of a wide range of domains across different products and services, cross-domain sentiment analysis methods have received significant attention. These methods mitigate the domain gap between different applications by training cross-domain generalizable classifiers which help to relax the need for data annotation for each domain. Most existing methods focus on learning domain-agnostic representations that are invariant with respect to both the source and the target domains. As a result, a classifier that is trained using the source domain annotated data would generalize well in a related target domain. We introduce a new domain adaptation method which induces large margins between different classes in an embedding space. This embedding space is trained to be domain-agnostic by matching the data distributions across the domains. Large intraclass margins in the source domain help to reduce the effect of "domain shift" on the classifier performance in the target domain. Theoretical and empirical analysis are provided to demonstrate that the proposed method is effective.
翻译:感官分析是企业研究客户的意见以改善产品并确定最佳营销战略的一项费用高昂但必要的任务。由于存在不同产品和服务的广泛领域,跨部情绪分析方法受到极大关注。这些方法通过培训跨部通用分类师,缩小了不同应用之间的领域差距,有助于放松对每个领域数据说明的需要。大多数现有方法侧重于学习对源和目标领域无差别的域名表征。因此,利用源域培训的分类员将附带说明的数据在相关目标领域加以概括。我们采用了一种新的域名适应方法,在嵌入空间的不同类别之间带来很大的边际。这种嵌入空间经过培训,通过匹配各个领域的数据分布,成为域名列的域名。源域内的大类内边际有助于减少“域变换”对目标领域分类员业绩的影响。提供了理论和实证分析,以证明拟议的方法是有效的。