Over the past decade, tremendous progress has been made in Recommender Systems (RecSys) for well-known tasks such as next-item and next-basket prediction. On the other hand, the recently proposed next-period recommendation (NPR) task is not covered as much. Current works about NPR are mostly based around distinct problem formulations, methods, and proprietary datasets, making solutions difficult to reproduce. In this article, we aim to fill the gap in RecSys methods evaluation on the NPR task using publicly available datasets and (1) introduce the TTRS, a large-scale financial transactions dataset suitable for RecSys methods evaluation; (2) benchmark popular RecSys approaches on several datasets for the NPR task. When performing our analysis, we found a strong repetitive consumption pattern in several real-world datasets. With this setup, our results suggest that the repetitive nature of data is still hard to generalize for the evaluated RecSys methods, and novel item prediction performance is still questionable.
翻译:过去十年来,在诸如下一个项目和下一个篮子预测等众所周知的任务方面,建议系统(RecSys)取得了巨大进展。另一方面,最近提出的下期建议(NPR)任务没有同样多的内容。目前关于NPR的工作主要围绕不同的问题配制、方法和专有数据集进行,使解决办法难以复制。在本条中,我们的目标是利用公开可得的数据集填补对NPR任务进行RecSys方法评估方面的差距,并(1)引入TTRS,这是适合RecSys方法评估的大规模财务交易数据集;(2)为NPR任务的若干数据集确定流行的RecSys方法基准。我们进行分析时发现,在几个真实世界数据集中存在强烈的重复消费模式。有了这个设置,我们的结果显示,数据重复的性质仍然难以为评估的RecSys方法概括,新的项目预测性仍然值得怀疑。