Much of the complexity of Recommender Systems (RSs) comes from the fact that they are used as part of more complex applications and affect user experience through a varied range of user interfaces. However, research focused almost exclusively on the ability of RSs to produce accurate item rankings while giving little attention to the evaluation of RS behavior in real-world scenarios. Such narrow focus has limited the capacity of RSs to have a lasting impact in the real world and makes them vulnerable to undesired behavior, such as reinforcing data biases. We propose EvalRS as a new type of challenge, in order to foster this discussion among practitioners and build in the open new methodologies for testing RSs "in the wild".
翻译:推荐人系统(RSs)的复杂程度很大是因为它们被用作更复杂应用的一部分,并通过各种用户界面影响用户经验,然而,研究几乎完全集中于RSs编制准确的项目排名的能力,而很少注意对现实世界情景中RS行为的评估。这种狭隘的焦点限制了RSs在现实世界中产生持久影响的能力,使他们易受不理想行为的影响,例如强化数据偏差。我们建议EvalRS系统作为一种新的挑战,以促进从业人员之间的这种讨论,并形成“野外”测试RSs的开放新方法。