In e-commerce, opinion tags refer to a ranked list of tags provided by the e-commerce platform that reflect characteristics of reviews of an item. To assist consumers to quickly grasp a large number of reviews about an item, opinion tags are increasingly being applied by e-commerce platforms. Current mechanisms for generating opinion tags rely on either manual labelling or heuristic methods, which is time-consuming and ineffective. In this paper, we propose the abstractive opinion tagging task, where systems have to automatically generate a ranked list of opinion tags that are based on, but need not occur in, a given set of user-generated reviews. The abstractive opinion tagging task comes with three main challenges: (1) the noisy nature of reviews; (2) the formal nature of opinion tags vs. the colloquial language usage in reviews; and (3) the need to distinguish between different items with very similar aspects. To address these challenges, we propose an abstractive opinion tagging framework, named AOT-Net, to generate a ranked list of opinion tags given a large number of reviews. First, a sentence-level salience estimation component estimates each review's salience score. Next, a review clustering and ranking component ranks reviews in two steps: first, reviews are grouped into clusters and ranked by cluster size; then, reviews within each cluster are ranked by their distance to the cluster center. Finally, given the ranked reviews, a rank-aware opinion tagging component incorporates an alignment feature and alignment loss to generate a ranked list of opinion tags. To facilitate the study of this task, we create and release a large-scale dataset, called eComTag, crawled from real-world e-commerce websites. Extensive experiments conducted on the eComTag dataset verify the effectiveness of the proposed AOT-Net in terms of various evaluation metrics.
翻译:在电子商务中,意见标签是指电子商务平台提供的反映某一项目审查特点的排名标签清单。为了帮助消费者快速掌握对某一项目的大量审查,电子商业平台正在越来越多地应用意见标签。目前生成意见标签的机制依赖于人工标签或粗略方法,这既耗时又无效。在本文中,我们建议抽象的意见标记任务,即系统必须自动生成一个反映某一项目审查特点的排序意见标签清单,这些清单反映的是某一组用户产生的审查特点。为了帮助消费者快速掌握对某一项目的大量审查,电子意见标签正在越来越多地被电子商务平台应用。目前生成意见标签的机制依靠的是人工标签或粗略方法,这种方法既耗时费又无效。在本文中,我们建议一个抽象的意见标记框架,即系统必须自动生成一个基于但无需在一组用户生成的审查中出现的排序标签清单。首先,我们评价的一级电子定位部分估计了每项审查的强度;随后,数据评分在每组中,逐级排序中,逐级审查:最后一组项目评分到组。