This paper introduces a simple and effective form of data augmentation for recommender systems. A paraphrase similarity model is applied to widely available textual data, such as reviews and product descriptions, yielding new semantic relations that are added to the user-item graph. This increases the density of the graph without needing further labeled data. The data augmentation is evaluated on a variety of recommendation algorithms, using Euclidean, hyperbolic, and complex spaces, and over three categories of Amazon product reviews with differing characteristics. Results show that the data augmentation technique provides significant improvements to all types of models, with the most pronounced gains for knowledge graph-based recommenders, particularly in cold-start settings, leading to state-of-the-art performance.
翻译:本文为推荐人系统介绍了一种简单而有效的数据增强形式,对广泛可得的文本数据,例如审查和产品说明,采用了一种参数相似性模型,产生新的语义关系,添加到用户项目图中,从而增加了图形的密度,而不需要进一步的标签数据。数据增强是根据各种建议算法进行评估的,使用的是Euclidean、双曲和复杂的空间,以及具有不同特点的三类亚马逊产品审查。结果显示,数据增强技术大大改进了所有类型的模型,对基于图表的知识型推荐人,特别是在寒冷起伏的环境下,带来了最显著的收益,导致最先进的性能。