Most existing recommender systems focus primarily on matching users to content which maximizes user satisfaction on the platform. It is increasingly obvious, however, that content providers have a critical influence on user satisfaction through content creation, largely determining the content pool available for recommendation. A natural question thus arises: can we design recommenders taking into account the long-term utility of both users and content providers? By doing so, we hope to sustain more providers and a more diverse content pool for long-term user satisfaction. Understanding the full impact of recommendations on both user and provider groups is challenging. This paper aims to serve as a research investigation of one approach toward building a provider-aware recommender, and evaluating its impact in a simulated setup. To characterize the user-recommender-provider interdependence, we complement user modeling by formalizing provider dynamics as well. The resulting joint dynamical system gives rise to a weakly-coupled partially observable Markov decision process driven by recommender actions and user feedback to providers. We then build a REINFORCE recommender agent, coined EcoAgent, to optimize a joint objective of user utility and the counterfactual utility lift of the provider associated with the recommended content, which we show to be equivalent to maximizing overall user utility and the utilities of all providers on the platform under some mild assumptions. To evaluate our approach, we introduce a simulation environment capturing the key interactions among users, providers, and the recommender. We offer a number of simulated experiments that shed light on both the benefits and the limitations of our approach. These results help understand how and when a provider-aware recommender agent is of benefit in building multi-stakeholder recommender systems.
翻译:大多数现有建议系统主要侧重于使用户与能够最大限度地提高平台用户满意度的内容相匹配。然而,越来越明显的是,内容提供者通过内容创建对用户满意度产生重要影响,主要是确定可供推荐的内容库。因此自然产生的一个问题是:我们能否设计建议者,同时考虑到用户和内容提供者的长期效用?通过这样做,我们希望维持更多的提供者和更加多样化的内容库,供用户长期满意;理解建议对用户和提供者群体的全面影响具有挑战性。本文的目的是研究建立一个供应商认识的简易建议,并在模拟设置中评价其影响。要描述用户-建议提供者-提供者的相互依存性,我们通过正式确定供应商的动态来补充用户模式;由此产生的联合动态系统将产生一个弱相联的、部分可观察的Markov决策程序,由建议者行动和用户对供应商的反馈来驱动。然后我们建立一个REINFORWA建议机构,即启动的提供商,以优化用户的建设目标,在模拟设置过程中评价其影响效果;我们为用户的模拟应用性和反现实效用提升了整个供应商的效用,在推荐的平台上,我们向用户提供一种最起码的动力的系统提供一种效益。