Machine-learning based recommender systems(RSs) has become an effective means to help people automatically discover their interests. Existing models often represent the rich information for recommendation, such as items, users, and contexts, as embedding vectors and leverage them to predict users' feedback. In the view of causal analysis, the associations between these embedding vectors and users' feedback are a mixture of the causal part that describes why an item is preferred by a user, and the non-causal part that merely reflects the statistical dependencies between users and items, for example, the exposure mechanism, public opinions, display position, etc. However, existing RSs mostly ignored the striking differences between the causal parts and non-causal parts when using these embedding vectors. In this paper, we propose a model-agnostic framework named IV4Rec that can effectively decompose the embedding vectors into these two parts, hence enhancing recommendation results. Specifically, we jointly consider users' behaviors in search scenarios and recommendation scenarios. Adopting the concepts in causal analysis, we embed users' search behaviors as instrumental variables (IVs), to help decompose original embedding vectors in recommendation, i.e., treatments. IV4Rec then combines the two parts through deep neural networks and uses the combined results for recommendation. IV4Rec is model-agnostic and can be applied to a number of existing RSs such as DIN and NRHUB. Experimental results on both public and proprietary industrial datasets demonstrate that IV4Rec consistently enhances RSs and outperforms a framework that jointly considers search and recommendation.
翻译:以机器学习为基础的推荐系统(RSs)已成为帮助人们自动发现自身兴趣的有效手段。 现有的模型通常代表建议所用的丰富信息, 如项目、 用户和背景等, 作为嵌入矢量, 并利用它们来预测用户的反馈。 从因果分析的角度看, 这些嵌入矢量和用户反馈之间的关联是因果部分的混合, 它说明了为什么用户偏爱某个项目, 而非因果部分仅仅反映了用户和项目(例如接触机制、 公众意见、 显示位置等)之间的统计依赖性。 但是, 现有的RSs在使用这些嵌入矢量时, 大多忽略了因果部分和非因果部分之间的惊人差异。 在本文中, 我们提议了一个名为IV4的模型框架, 能够有效地将嵌入矢量嵌入这两个部分, 从而增强建议的结果。 具体地说, 我们共同考虑用户在搜索假设和建议假设中的行为。 采用因果分析中, 我们将用户的搜索行为作为工具变量(IVs), 帮助将因果部分和非因果部分混合存储 IV 。