User embeddings (vectorized representations of a user) are essential in recommendation systems. Numerous approaches have been proposed to construct a representation for the user in order to find similar items for retrieval tasks, and they have been proven effective in industrial recommendation systems as well. Recently people have discovered the power of using multiple embeddings to represent a user, with the hope that each embedding represents the user's interest in a certain topic. With multi-interest representation, it's important to model the user's preference over the different topics and how the preference change with time. However, existing approaches either fail to estimate the user's affinity to each interest or unreasonably assume every interest of every user fades with an equal rate with time, thus hurting the recall of candidate retrieval. In this paper, we propose the Multi-Interest Preference (MIP) model, an approach that not only produces multi-interest for users by using the user's sequential engagement more effectively but also automatically learns a set of weights to represent the preference over each embedding so that the candidates can be retrieved from each interest proportionally. Extensive experiments have been done on various industrial-scale datasets to demonstrate the effectiveness of our approach.
翻译:在建议系统中,用户嵌入(用户的压载式表达方式)至关重要。 已经提出了许多方法,为用户构建一个代表,以寻找相似的检索任务项目,这些方法在工业建议系统中也被证明是有效的。 最近,人们发现了使用多个嵌入方式代表用户的力量,希望每个嵌入方式代表用户对某个主题的兴趣。 以多种利益代表方式,它对于模拟用户对不同主题的偏好和选择随时间的变化方式非常重要。 但是,现有的方法要么未能估计用户对每项兴趣的亲近性,要么不合理地假定每个用户的每一项利益都随着时间和时间的相同而消失,从而影响对候选人检索的回溯。 在本文中,我们提出了多内在参照模式,这一模式不仅通过使用用户的顺序接触方式对用户产生多重利益,而且自动地学习一系列重心来代表对每个嵌入的偏好,以便候选人能够从每种兴趣中按比例检索。 在各种工业规模的数据设置上进行了广泛的实验,以展示了各种工业规模数据的有效性。