利用原始要素模型对建议系统进行文字审查 (Utilizing Textual Reviews in Latent Factor Models for Recommender Systems)

Most of the existing recommender systems are based only on the rating data, and they ignore other sources of information that might increase the quality of recommendations, such as textual reviews, or user and item characteristics. Moreover, the majority of those systems are applicable only on small datasets (with thousands of observations) and are unable to handle large datasets (with millions of observations). We propose a recommender algorithm that combines a rating modelling technique (i.e., Latent Factor Model) with a topic modelling method based on textual reviews (i.e., Latent Dirichlet Allocation), and we extend the algorithm such that it allows adding extra user- and item-specific information to the system. We evaluate the performance of the algorithm using Amazon.com datasets with different sizes, corresponding to 23 product categories. After comparing the built model to four other models we found that combining textual reviews with ratings leads to better recommendations. Moreover, we found that adding extra user and item features to the model increases its prediction accuracy, which is especially true for medium and large datasets.

翻译：现有建议者系统大多仅以评级数据为基础,它们忽视了其他可能提高建议质量的信息来源,如文字审查或用户和项目特性;此外,这些系统大多只适用于小数据集(有数千项观察),无法处理大数据集(有数百万项观察);我们建议一种推荐者算法,将评级建模技术(即 " 后端因子模型 " )与基于文本审查的专题建模方法(即 " 中端 Dirichlet分配 " )结合起来,我们扩大算法,允许将额外的用户和项目特定信息添加到系统中;我们利用不同大小的亚马逊.com数据集评估算法的性能,对应23个产品类别;在将已建模型与另外4个模型进行比较后,我们发现将文字审查与评级相结合后,可以得出更好的建议;此外,我们发现在模型中、大数据集中增加用户和项目特征会提高预测准确性,尤其是如此。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

近期必读的六篇 ICLR 2021【推荐系统】相关投稿论文

专知会员服务

47+阅读 · 2020年10月13日