深点击频深率预测的逆向梯度探索驱动器 (Adversarial Gradient Driven Exploration for Deep Click-Through Rate Prediction)

Exploration-Exploitation (E{\&}E) algorithms are commonly adopted to deal with the feedback-loop issue in large-scale online recommender systems. Most of existing studies believe that high uncertainty can be a good indicator of potential reward, and thus primarily focus on the estimation of model uncertainty. We argue that such an approach overlooks the subsequent effect of exploration on model training. From the perspective of online learning, the adoption of an exploration strategy would also affect the collecting of training data, which further influences model learning. To understand the interaction between exploration and training, we design a Pseudo-Exploration module that simulates the model updating process after a certain item is explored and the corresponding feedback is received. We further show that such a process is equivalent to adding an adversarial perturbation to the model input, and thereby name our proposed approach as an the Adversarial Gradient Driven Exploration (AGE). For production deployment, we propose a dynamic gating unit to pre-determine the utility of an exploration. This enables us to utilize the limited amount of resources for exploration, and avoid wasting pageview resources on ineffective exploration. The effectiveness of AGE was firstly examined through an extensive number of ablation studies on an academic dataset. Meanwhile, AGE has also been deployed to one of the world-leading display advertising platforms, and we observe significant improvements on various top-line evaluation metrics.

翻译：在大型在线推荐人系统中,通常采用勘探-探索算法(E ⁇ E)来处理反馈-浏览问题。大多数现有研究都认为,高度不确定性可以成为潜在报酬的良好指标,因此主要侧重于模型不确定性的估计。我们认为,这种方法忽略了探索对模型培训的随后影响。从在线学习的角度来看,采用勘探战略也会影响培训数据的收集,从而进一步影响模型学习。为了了解勘探和培训之间的相互作用,我们设计了一个模拟模型更新过程的优度-探索模块,在探索某个项目并收到相应的反馈后模拟模型更新过程。我们进一步表明,这种过程相当于给模型输入增加一个对立的渗透,从而将我们拟议的方法命名为对模型培训的快速驱动探索(AGAGE)。关于生产部署,我们提出一个动态的定位单位,以预先确定勘探的效用。这使我们能够利用有限的资源进行勘探,并避免在无效的探索中浪费页面资源。我们进一步表明,这样一个过程相当于给模型输入一个对抗性干扰,从而将我们拟议的方法命名为“快速探索” 。我们第一次通过对所部署的高级数据库进行了广泛的实地评估,我们通过对一个世界进行一项重大的升级研究。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日