Review-based recommender systems have gained noticeable ground in recent years. In addition to the rating scores, those systems are enriched with textual evaluations of items by the users. Neural language processing models, on the other hand, have already found application in recommender systems, mainly as a means of encoding user preference data, with the actual textual description of items serving only as side information. In this paper, a novel approach to incorporating the aforementioned models into the recommendation process is presented. Initially, a neural language processing model and more specifically the paragraph vector model is used to encode textual user reviews of variable length into feature vectors of fixed length. Subsequently this information is fused along with the rating scores in a probabilistic matrix factorization algorithm, based on maximum a-posteriori estimation. The resulting system, ParVecMF, is compared to a ratings' matrix factorization approach on a reference dataset. The obtained preliminary results on a set of two metrics are encouraging and may stimulate further research in this area.
翻译:近年来,基于审查的推荐人系统取得了显著进展,除了评级评分外,这些系统还随着用户对项目的文字评价而得到丰富,神经语言处理模型在推荐人系统中已经找到应用,主要是将用户偏好数据编码,实际的文字描述仅作为附带信息,本文介绍了将上述模式纳入建议过程的新办法;最初,使用神经语言处理模型,更具体地说,使用段落矢量模型,将文本用户对可变长度的文字审查编码成固定长度的特性矢量;随后,这些信息与概率矩阵因数算法中的评级分数结合,以最高值为根据,结果的系统ParVecMF与参考数据集的评级矩阵因数法比较,获得的一套两套衡量方法的初步结果令人鼓舞,并可能促进这方面的进一步研究。