Explainable recommendation has attracted much attention from the industry and academic communities. It has shown great potential for improving the recommendation persuasiveness, informativeness and user satisfaction. Despite a lot of promising explainable recommender models have been proposed in the past few years, the evaluation strategies of these models suffer from several limitations. For example, the explanation ground truths are not labeled by real users, the explanations are mostly evaluated based on only one aspect and the evaluation strategies can be hard to unify. To alleviate the above problems, we propose to build an explainable recommendation dataset with multi-aspect real user labeled ground truths. In specific, we firstly develop a video recommendation platform, where a series of questions around the recommendation explainability are carefully designed. Then, we recruit about 3000 users with different backgrounds to use the system, and collect their behaviors and feedback to our questions. In this paper, we detail the construction process of our dataset and also provide extensive analysis on its characteristics. In addition, we develop a library, where ten well-known explainable recommender models are implemented in a unified framework. Based on this library, we build several benchmarks for different explainable recommendation tasks. At last, we present many new opportunities brought by our dataset, which are expected to shed some new lights to the explainable recommendation field. Our dataset, library and the related documents have been released at https://reasoner2023.github.io/.
翻译:解释性建议吸引了业界和学术界的极大关注,显示出了改进建议说服力、信息丰富度和用户满意度的巨大潜力。尽管在过去几年里提出了许多大有希望的可解释性建议模式,但这些模式的评价战略存在若干限制。例如,解释性地面真相没有真实用户的标签,解释性解释大多只根据一个方面进行评估,评价战略很难统一。为了缓解上述问题,我们提议建立一个可解释的建议数据集,配有多层用户贴上标签的真正地面真相。具体地说,我们首先开发了一个视频建议平台,其中围绕建议的解释性提出了一系列问题,经过仔细设计。然后,我们征聘了大约3 000个具有不同背景的用户来使用系统,收集他们的行为和对我们的问题的反馈。在这份文件中,我们详细说明了我们数据集的构建过程,并且对其特征进行了广泛的分析。此外,我们开发了一个图书馆,在这个有10个广为人知的可解释性建议模型在一个统一的框架中实施。基于这个图书馆,我们为不同的可解释性建议设计了几个基准。然后,我们为不同的可解释性建议设计了几个基准。我们最新的实地提供了一些与我们目前数据相关的机会。我们的数据设置的实地。</s>