Model-based methods for recommender systems have been studied extensively in recent years. In systems with large corpus, however, the calculation cost for the learnt model to predict all user-item preferences is tremendous, which makes full corpus retrieval extremely difficult. To overcome the calculation barriers, models such as matrix factorization resort to inner product form (i.e., model user-item preference as the inner product of user, item latent factors) and indexes to facilitate efficient approximate k-nearest neighbor searches. However, it still remains challenging to incorporate more expressive interaction forms between user and item features, e.g., interactions through deep neural networks, because of the calculation cost. In this paper, we focus on the problem of introducing arbitrary advanced models to recommender systems with large corpus. We propose a novel tree-based method which can provide logarithmic complexity w.r.t. corpus size even with more expressive models such as deep neural networks. Our main idea is to predict user interests from coarse to fine by traversing tree nodes in a top-down fashion and making decisions for each user-node pair. We also show that the tree structure can be jointly learnt towards better compatibility with users' interest distribution and hence facilitate both training and prediction. Experimental evaluations with two large-scale real-world datasets show that the proposed method significantly outperforms traditional methods. Online A/B test results in Taobao display advertising platform also demonstrate the effectiveness of the proposed method in production environments.
翻译:近些年来,人们广泛研究了以模型为基础的推荐系统方法。然而,在大型系统中,预测所有用户-项目偏好所学模型的计算成本是巨大的,这使得充分检索非常困难。为了克服计算障碍,矩阵因子化模型等模型采用内部产品形式(例如,模型用户-项目偏好作为用户的内部产品,项目潜在因素)和指数,以促进高效率的近距离近距离搜索。然而,在使用和项目特性之间采用更清晰的交互形式,例如,由于计算成本,通过深层神经网络进行互动,仍然具有挑战性。在本文中,我们侧重于采用任意的先进模型来向大型系统提出建议的问题。我们提出了一种新的基于树基的方法,这种方法可以提供逻辑复杂性(例如,作为用户的内部产品,项目潜在因素)和指数,即使采用更清晰的模型,例如深层的神经网络,也能够预测用户的兴趣,从上下到细小的树节点,到为每个用户-点配方做出决策。我们还表明,在在线平台中,采用任意的先进模型,可以共同学习和实验性方法,从而显示大规模地试验式的模型的用户-测试方法,从而显示,从而显示,试验式的模型的模型的模型的模型可以显示,从而显示,从而显示,结果的模型的模型可以显示,从而显示,从而显示,从而显示,在大规模的模型的模拟的模型的模型的模型的模型的模型的模型的模型的估价方法可以显示,从而显示,从而显示,从而显示,从而显示,从而显示,从而显示,从而显示,显示,从而显示,使用户-试验式的模拟式的模型的模型的模型的模拟式的估价方法可以显示,从而显示,使用户-