The content that a recommender system (RS) shows to users influences them. Therefore, when choosing a recommender to deploy, one is implicitly also choosing to induce specific internal states in users. Even more, systems trained via long-horizon optimization will have direct incentives to manipulate users: in this work, we focus on the incentive to shift user preferences so they are easier to satisfy. We argue that - before deployment - system designers should: estimate the shifts a recommender would induce; evaluate whether such shifts would be undesirable; and perhaps even actively optimize to avoid problematic shifts. These steps involve two challenging ingredients: estimation requires anticipating how hypothetical algorithms would influence user preferences if deployed - we do this by using historical user interaction data to train a predictive user model which implicitly contains their preference dynamics; evaluation and optimization additionally require metrics to assess whether such influences are manipulative or otherwise unwanted - we use the notion of "safe shifts", that define a trust region within which behavior is safe: for instance, the natural way in which users would shift without interference from the system could be deemed "safe". In simulated experiments, we show that our learned preference dynamics model is effective in estimating user preferences and how they would respond to new recommenders. Additionally, we show that recommenders that optimize for staying in the trust region can avoid manipulative behaviors while still generating engagement.
翻译:推荐者系统(RS)向用户展示的内容对用户有影响。因此,在选择推荐者时,人们也暗含地选择引导用户的特定内部状态。更何况,通过长正正正正优化培训的系统将直接激励用户操纵:在这项工作中,我们侧重于改变用户偏好以便更容易满足的激励。我们主张,在部署之前,系统设计者应当:估计推荐者所引发的转变;评估这种转变是否不可取;或许甚至积极优化以避免有问题的转变。这些步骤涉及两个具有挑战性的因素:估计要求预测假设的算法如果被部署将如何影响用户偏好——我们这样做的方法是利用历史用户互动数据来培训一种预知用户模型,该模型隐含其偏好动态;评估和优化额外要求用量度来评估这种影响是否具有操纵性或非必要性。我们使用“安全转变”的概念来界定一个行为安全的信任区域:例如,用户在不受系统干扰的情况下会自然地转移。这些步骤涉及两个具有挑战的因素:如果被部署,则需要预见假设的算出假设性算法会如何影响用户偏好-我们这样做时,我们利用历史用户交互互动数据来训练者在评估用户偏好度模型时会建议如何评估用户偏好度,我们如何在评估用户偏好度时,我们如何在评估用户偏好度。