CTR 预测中用户行为模型的 Kalman 过滤注意 Kalman 过滤器 (Kalman Filtering Attention for User Behavior Modeling in CTR Prediction)

Click-through rate (CTR) prediction is one of the fundamental tasks for e-commerce search engines. As search becomes more personalized, it is necessary to capture the user interest from rich behavior data. Existing user behavior modeling algorithms develop different attention mechanisms to emphasize query-relevant behaviors and suppress irrelevant ones. Despite being extensively studied, these attentions still suffer from two limitations. First, conventional attentions mostly limit the attention field only to a single user's behaviors, which is not suitable in e-commerce where users often hunt for new demands that are irrelevant to any historical behaviors. Second, these attentions are usually biased towards frequent behaviors, which is unreasonable since high frequency does not necessarily indicate great importance. To tackle the two limitations, we propose a novel attention mechanism, termed Kalman Filtering Attention (KFAtt), that considers the weighted pooling in attention as a maximum a posteriori (MAP) estimation. By incorporating a priori, KFAtt resorts to global statistics when few user behaviors are relevant. Moreover, a frequency capping mechanism is incorporated to correct the bias towards frequent behaviors. Offline experiments on both benchmark and a 10 billion scale real production dataset, together with an Online A/B test, show that KFAtt outperforms all compared state-of-the-arts. KFAtt has been deployed in the ranking system of a leading e commerce website, serving the main traffic of hundreds of millions of active users everyday.

翻译：点击率( CTR) 预测是电子商务搜索引擎的基本任务之一。随着搜索变得更加个性化, 有必要从丰富的行为数据中捕捉用户的兴趣。现有的用户行为模型算法开发了不同的关注机制, 以强调与询问有关的行为, 压制无关的行为。尽管正在广泛研究, 这些关注仍然受到两个限制。首先, 常规关注大多将关注领域限制在单一用户的行为上, 而这在电子商务中并不合适, 用户往往寻找与任何历史行为无关的新需求。其次, 这些关注通常偏向频繁的行为, 因为高频率不一定表明其不合理。为了应对这两个限制, 我们提议了一个新的关注机制, 称为 Kalman 过滤( KFATt), 将加权集中关注视为一个最高顺位( MAP) 估计。首先, KFA 用户行为很少相关时, KFAT 使用频率上限机制来纠正经常发生的电子行为, 因为高频并不合理, 我们提议一个新的关注机制, 以上百万个日历的在线系统为基准/ KFA级别, 的直径直径直达10亿次的Seralalalalalal 测试显示所有KFA级实际生产排名。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

【NeurIPS 2020京东】基于卡尔曼滤波的注意力机制—广告点击率预估中的用户行为建模

专知会员服务

26+阅读 · 2020年10月30日

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

专知会员服务

52+阅读 · 2020年5月26日

【百度开源2019年新型冠状病毒RNA预测算法】Baidu Open-Sources RNA Prediction Algorithm for 2019 Novel Coronavirus

专知会员服务

26+阅读 · 2020年2月6日

【AAAI2020-清华大学】高效的异构协同过滤推荐（Efficient Heterogeneous Collaborative Filtering without Negative Sampling for Recommendation），张敏，马少平等

专知会员服务

61+阅读 · 2019年11月22日