In academic research, recommender systems are often evaluated on benchmark datasets, without much consideration about the global timeline. Hence, we are unable to answer questions like: Do loyal users enjoy better recommendations than non-loyal users? Loyalty can be defined by the time period a user has been active in a recommender system, or by the number of historical interactions a user has. In this paper, we offer a comprehensive analysis of recommendation results along global timeline. We conduct experiments with five widely used models, i.e., BPR, NeuMF, LightGCN, SASRec and TiSASRec, on four benchmark datasets, i.e., MovieLens-25M, Yelp, Amazon-music, and Amazon-electronic. Our experiment results give an answer "No" to the above question. Users with many historical interactions suffer from relatively poorer recommendations. Users who stay with the system for a shorter time period enjoy better recommendations. Both findings are counter-intuitive. Interestingly, users who have recently interacted with the system, with respect to the time point of the test instance, enjoy better recommendations. The finding on recency applies to all users, regardless of users' loyalty. Our study offers a different perspective to understand recommender accuracy, and our findings could trigger a revisit of recommender model design. The code is available in \url{https://github.com/putatu/recommenderLoyalty.
翻译:在学术研究中,建议者系统往往在基准数据集上进行评估,而没有太多考虑到全球时间表。因此,我们无法回答如下问题:忠诚的用户是否比非用户更享受更好的建议?忠诚可以按用户在推荐系统活跃的时间段或用户的历史互动次数来定义。在本文中,我们根据全球时间表对建议结果进行全面分析。我们用五种广泛使用的模式,即BPR、NeuMF{LightGCN、SASRec和TiSASRec进行实验,这四套基准数据集,即MovieLens-25M、Yelp、Amaz-音乐和亚马孙-电子。我们的实验结果给出了对上述问题的“否”答案。许多具有历史互动的用户会受到相对较差的建议。在较短的时间内留在系统中的用户会得到更好的建议。两种结论都是反直观的。有趣的是,最近与系统互动的用户,在测试实例的时间点上,享受更好的建议。关于对用户的直观和触发性观点是,我们所有用户都了解了我们提出的选择。