Recommender systems rely heavily on increasing computation resources to improve their business goal. By deploying computation-intensive models and algorithms, these systems are able to inference user interests and exhibit certain ads or commodities from the candidate set to maximize their business goals. However, such systems are facing two challenges in achieving their goals. On the one hand, facing massive online requests, computation-intensive models and algorithms are pushing their computation resources to the limit. On the other hand, the response time of these systems is strictly limited to a short period, e.g. 300 milliseconds in our real system, which is also being exhausted by the increasingly complex models and algorithms. In this paper, we propose the computation resource allocation solution (CRAS) that maximizes the business goal with limited computation resources and response time. We comprehensively illustrate the problem and formulate such a problem as an optimization problem with multiple constraints, which could be broken down into independent sub-problems. To solve the sub-problems, we propose the revenue function to facilitate the theoretical analysis, and obtain the optimal computation resource allocation strategy. To address the applicability issues, we devise the feedback control system to help our strategy constantly adapt to the changing online environment. The effectiveness of our method is verified by extensive experiments based on the real dataset from Taobao.com. We also deploy our method in the display advertising system of Alibaba. The online results show that our computation resource allocation solution achieves significant business goal improvement without any increment of computation cost, which demonstrates the efficacy of our method in real industrial practice.
翻译:推荐人系统严重依赖增加计算资源来改进其业务目标。通过部署计算密集型模型和算法,这些系统能够推断用户的兴趣,并从候选人中展示某些广告或商品,以最大限度地实现其业务目标。然而,这些系统在实现其目标方面面临着两个挑战。一方面,面临大量在线请求,计算密集型模型和算法将计算资源推向极限。另一方面,这些系统的响应时间严格限于短期,例如,我们实际系统中的300毫秒,而日益复杂的模型和算法也正在耗尽这些系统。在本文件中,我们提出计算资源分配的计算方法(CRAS),以有限的计算资源和反应时间来最大限度地实现业务目标。我们全面说明问题,并将问题发展成一个有多种限制的优化问题。另一方面,为了解决这些子问题,我们建议收入功能,以便利理论分析,并获得最佳的计算资源分配战略。为了解决适用性问题,我们设计了计算效率的计算方法,我们在计算成本和算法方面,我们通过不使用在线计算方法,不断调整我们实际的计算方法。我们用在线计算方法来验证我们实际的计算方法的计算结果。我们用在线计算方法,我们用在线计算方法来验证我们的计算结果,我们不断改变我们的计算方法,我们用在线计算方法显示我们的计算方法的计算结果。我们根据我们的计算方法的计算结果,我们用在线计算结果,我们用在线计算方法来显示我们的计算方法的计算方法的计算方法显示我们的计算结果。