In this era of information explosion, a personalized recommendation system is convenient for users to get information they are interested in. To deal with billions of users and items, large-scale online recommendation services usually consist of three stages: candidate generation, coarse-grained ranking, and fine-grained ranking. The success of each stage depends on whether the model accurately captures the interests of users, which are usually hidden in users' behavior data. Previous research shows that users' interests are diverse, and one vector is not sufficient to capture users' different preferences. Therefore, many methods use multiple vectors to encode users' interests. However, there are two unsolved problems: (1) The similarity of different vectors in existing methods is too high, with too much redundant information. Consequently, the interests of users are not fully represented. (2) Existing methods model the long-term and short-term behaviors together, ignoring the differences between them. This paper proposes a Hierarchical Multi-Interest Co-Network (HCN) to capture users' diverse interests in the coarse-grained ranking stage. Specifically, we design a hierarchical multi-interest extraction layer to update users' diverse interest centers iteratively. The multiple embedded vectors obtained in this way contain more information and represent the interests of users better in various aspects. Furthermore, we develop a Co-Interest Network to integrate users' long-term and short-term interests. Experiments on several real-world datasets and one large-scale industrial dataset show that HCN effectively outperforms the state-of-the-art methods. We deploy HCN into a large-scale real world E-commerce system and achieve extra 2.5\% improvements on GMV (Gross Merchandise Value).
翻译:在这个信息爆炸的时代,个性化建议系统方便用户获取他们感兴趣的信息。为了处理数十亿用户和项目,大规模在线建议服务通常分为三个阶段:(1) 现有方法中不同矢量的相似性太高,信息过多。因此,用户的利益没有得到充分体现。 (2) 现有方法建模长期和短期行为,同时不考虑用户行为数据的差异。 本文建议使用一种高压多端网络,以了解用户的不同偏好。因此,许多方法都使用多种矢量来编码用户的利益。然而,有两个尚未解决的问题:(1) 现有方法中不同矢量的相似性太高,信息过多。因此,用户的利益没有得到充分体现。 (2) 现有方法建模长期和短期行为,同时不考虑用户的行为差异。 本文建议使用一种超纯度多端多端网络,从而将用户的不同兴趣纳入高端网络。 具体地说,我们设计一种等级的多级多级、跨级的矢量的用户利益,从而更新用户的多层数据。