In the Click-Through Rate (CTR) prediction scenario, user's sequential behaviors are well utilized to capture the user interest in the recent literature. However, despite being extensively studied, these sequential methods still suffer from three limitations. First, existing methods mostly utilize attention on the behavior of users, which is not always suitable for CTR prediction, because users often click on new products that are irrelevant to any historical behaviors. Second, in the real scenario, there exist numerous users that have operations a long time ago, but turn relatively inactive in recent times. Thus, it is hard to precisely capture user's current preferences through early behaviors. Third, multiple representations of user's historical behaviors in different feature subspaces are largely ignored. To remedy these issues, we propose a Multi-Interactive Attention Network (MIAN) to comprehensively extract the latent relationship among all kinds of fine-grained features (e.g., gender, age and occupation in user-profile). Specifically, MIAN contains a Multi-Interactive Layer (MIL) that integrates three local interaction modules to capture multiple representations of user preference through sequential behaviors and simultaneously utilize the fine-grained user-specific as well as context information. In addition, we design a Global Interaction Module (GIM) to learn the high-order interactions and balance the different impacts of multiple features. Finally, Offline experiment results from three datasets, together with an Online A/B test in a large-scale recommendation system, demonstrate the effectiveness of our proposed approach.
翻译:在点击浏览率(CTR)预测情景中,用户的相继行为被很好地用于捕捉用户对最近文献的兴趣。然而,尽管经过广泛研究,这些相继方法仍然受到三个限制。首先,现有方法大多对用户行为的关注,而这种关注并不总是适合CTR预测,因为用户经常点击与任何历史行为无关的新产品。第二,在实际情况下,有许多用户在很久以前就已运作,但最近时相对不活跃。因此,很难通过早期行为准确地捕捉用户当前偏好。第三,对用户在不同特征子空间的历史行为的多次表述在很大程度上被忽略。为了解决这些问题,我们建议建立一个多互动关注网络(MIAN),以全面提取与任何细微特征(如性别、年龄和用户形象)无关的新产品之间的潜在关系。具体地说,MIAN包含一个多线互动方法,将三个本地互动模块整合到通过连续行为和同时使用不同特征的用户历史行为表态的多重表达方式。我们用高级模型来学习高比例的系统,最后的用户互动模型,作为我们不同层次的模型,并学习了全球数据的特定。