It is common for video-on-demand and music streaming services to adopt a user interface composed of several recommendation lists, i.e. widgets or swipeable carousels, each generated according to a specific criterion or algorithm (e.g. most recent, top popular, recommended for you, editors' choice, etc.). Selecting the appropriate combination of carousel has significant impact on user satisfaction. A crucial aspect of this user interface is that to measure the relevance a new carousel for the user it is not sufficient to account solely for its individual quality. Instead, it should be considered that other carousels will already be present in the interface. This is not considered by traditional evaluation protocols for recommenders systems, in which each carousel is evaluated in isolation, regardless of (i) which other carousels are displayed to the user and (ii) the relative position of the carousel with respect to other carousels. Hence, we propose a two-dimensional evaluation protocol for a carousel setting that will measure the quality of a recommendation carousel based on how much it improves upon the quality of an already available set of carousels. Our evaluation protocol takes into account also the position bias, i.e. users do not explore the carousels sequentially, but rather concentrate on the top-left corner of the screen. We report experiments on the movie domain and notice that under a carousel setting the definition of which criteria has to be preferred to generate a list of recommended items changes with respect to what is commonly understood.
翻译:视频点播和音乐流服务通常采用由若干建议列表组成的用户界面,即根据特定标准或算法(例如最近、最受欢迎、推荐给您、编辑选择等)生成的部件或可翻转的旋转木马。选择适当的旋转木马组合对用户满意度有重大影响。这个用户界面的一个重要方面是测量新旋转木马对用户的相关性,它不足以仅仅说明其个人质量。相反,应该考虑接口中已经存在其他旋转木马。对推荐者系统的传统评价协议通常不考虑这一点,其中每个旋转木马都是单独评估的,不管(一)其他旋转木马是向用户展示的,(二)旋转木马与其他旋转木马的相对位置。因此,我们建议用双维评价协议来测量旋转木马的特性,以它如何改进了已经具备的角心木马质量为标准。我们的评估协议在纸浆的轨道上要考虑的是,在纸浆的轨道上,我们要将什么是直径的轨道上,我们要在方向的轨道上做出一个选择。