The key of sequential recommendation lies in the accurate item correlation modeling. Previous models infer such information based on item co-occurrences, which may fail to capture the real causal relations, and impact the recommendation performance and explainability. In this paper, we equip sequential recommendation with a novel causal discovery module to capture causalities among user behaviors. Our general idea is firstly assuming a causal graph underlying item correlations, and then we learn the causal graph jointly with the sequential recommender model by fitting the real user behavior data. More specifically, in order to satisfy the causality requirement, the causal graph is regularized by a differentiable directed acyclic constraint. Considering that the number of items in recommender systems can be very large, we represent different items with a unified set of latent clusters, and the causal graph is defined on the cluster level, which enhances the model scalability and robustness. In addition, we provide theoretical analysis on the identifiability of the learned causal graph. To the best of our knowledge, this paper makes a first step towards combining sequential recommendation with causal discovery. For evaluating the recommendation performance, we implement our framework with different neural sequential architectures, and compare them with many state-of-the-art methods based on real-world datasets. Empirical studies manifest that our model can on average improve the performance by about 7% and 11% on f1 and NDCG, respectively. To evaluate the model explainability, we build a new dataset with human labeled explanations for both quantitative and qualitative analysis.
翻译:序列建议的关键在于精确的项目相关模型。 以前的模型根据项目共发生的情况推断出此类信息, 这可能无法捕捉真实的因果关系, 影响建议性能和解释性。 在本文中, 我们为顺序建议配置了一个新的因果发现模块, 以捕捉用户行为中的因果关系。 我们的一般想法是首先假设一个因果图, 作为项目相关性的基础, 然后我们通过匹配真实的用户行为数据, 与顺序建议模型一起学习因果图。 更具体地说, 为了满足因果关系的要求, 因果关系图被一个不同且可定向的周期性约束规范化。 考虑到建议系统中的项目数量可能非常大, 我们代表不同的项目, 并配有一套统一的潜在组群, 并且将因果图在组层次上定义, 从而增强模型的可缩放度和稳健性。 此外, 我们根据我们的知识, 本文迈出了第一步, 将顺序建议与因果发现相结合。 为了评估建议性, 我们用不同的可调度框架, 分别用一套统一的隐性数据结构, 来比较我们基于真实的顺序结构的模型和模型, 对比我们以11世纪的数据分析。