学习在线序列广告的推断用户隐藏国 (Learning to Infer User Hidden States for Online Sequential Advertising) - 专知论文

会员服务 ·

1

隐状态 · 推断 · Performer · 可理解性 · 部分可观测马尔可夫决策过程 ·

2020 年 9 月 3 日

Learning to Infer User Hidden States for Online Sequential Advertising

翻译：学习在线序列广告的推断用户隐藏国

Zhaoqing Peng,Junqi Jin,Lan Luo,Yaodong Yang,Rui Luo,Jun Wang,Weinan Zhang,Haiyang Xu,Miao Xu,Chuan Yu,Tiejian Luo,Han Li,Jian Xu,Kun Gai

from arxiv, to be published in CIKM 2020

To drive purchase in online advertising, it is of the advertiser's great interest to optimize the sequential advertising strategy whose performance and interpretability are both important. The lack of interpretability in existing deep reinforcement learning methods makes it not easy to understand, diagnose and further optimize the strategy. In this paper, we propose our Deep Intents Sequential Advertising (DISA) method to address these issues. The key part of interpretability is to understand a consumer's purchase intent which is, however, unobservable (called hidden states). In this paper, we model this intention as a latent variable and formulate the problem as a Partially Observable Markov Decision Process (POMDP) where the underlying intents are inferred based on the observable behaviors. Large-scale industrial offline and online experiments demonstrate our method's superior performance over several baselines. The inferred hidden states are analyzed, and the results prove the rationality of our inference.

翻译：为了推动在线广告的购买,广告商非常希望优化连续的广告战略,其性能和解释都很重要。现有深层强化学习方法缺乏解释性,因此难以理解、诊断和进一步优化战略。在本文中,我们提出了我们深层元素序列广告(DISA)解决这些问题的方法。解释性的关键部分是理解消费者购买意图,然而,这种意图是不可观察到的(所谓的隐蔽状态 ) 。在本文中,我们将这一意图作为潜在变量,并将问题表述为部分可观测的Markov 决策程序(POMDP ), 其基本意图根据可观察的行为推断。大规模工业离线和在线实验显示了我们的方法在几个基线上的优异性。对隐蔽状态进行了分析,结果证明了我们判断的合理性。

9

相关内容

隐状态

【MIT】反偏差对比学习，Debiased Contrastive Learning

【MIT】反偏差对比学习，Debiased Contrastive Learning

专知会员服务

91+阅读 · 2020年7月4日

【CVPR2020-国科大】状态标签对抗主动学习，Adversarial Active Learning

【CVPR2020-国科大】状态标签对抗主动学习，Adversarial Active Learning

专知会员服务

48+阅读 · 2020年4月13日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

专知会员服务

21+阅读 · 2019年12月2日

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

专知会员服务

35+阅读 · 2019年11月30日

【麻省理工学院课程】MIT 6.S191：Introduction to Deep Learning , 深度学习导论,NSF研究员Alexander Amini

【麻省理工学院课程】MIT 6.S191：Introduction to Deep Learning , 深度学习导论,NSF研究员Alexander Amini

专知会员服务

34+阅读 · 2019年11月2日

动手学深度学习Dive into Deep Learning中英文版本（附全套代码）

动手学深度学习Dive into Deep Learning中英文版本（附全套代码）

专知会员服务

111+阅读 · 2019年10月26日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

已删除

将门创投

5+阅读 · 2019年8月19日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Learning Tree-based Deep Model for Recommender Systems

Arxiv

3+阅读 · 2018年12月21日

Learning Personalized End-to-End Goal-Oriented Dialog

Arxiv

4+阅读 · 2018年11月12日

Mean Field Multi-Agent Reinforcement Learning

Arxiv

5+阅读 · 2018年6月12日

Relational Deep Reinforcement Learning

Arxiv

5+阅读 · 2018年6月5日

MARS: Memory Attention-Aware Recommender System

Arxiv

6+阅读 · 2018年5月18日

Generative Model for Heterogeneous Inference

Arxiv

4+阅读 · 2018年4月26日

Generative Adversarial Autoencoder Networks

Arxiv

11+阅读 · 2018年3月23日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

Denoising Adversarial Autoencoders

Arxiv

9+阅读 · 2018年1月4日

Deep CTR Prediction in Display Advertising

Arxiv

4+阅读 · 2016年9月20日

VIP会员

文章信息

相关主题

部分可观测马尔可夫决策过程

相关VIP内容

【MIT】反偏差对比学习，Debiased Contrastive Learning

【MIT】反偏差对比学习，Debiased Contrastive Learning

专知会员服务

91+阅读 · 2020年7月4日

【CVPR2020-国科大】状态标签对抗主动学习，Adversarial Active Learning

【CVPR2020-国科大】状态标签对抗主动学习，Adversarial Active Learning

专知会员服务

48+阅读 · 2020年4月13日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

专知会员服务

21+阅读 · 2019年12月2日

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

专知会员服务

35+阅读 · 2019年11月30日

【麻省理工学院课程】MIT 6.S191：Introduction to Deep Learning , 深度学习导论,NSF研究员Alexander Amini

【麻省理工学院课程】MIT 6.S191：Introduction to Deep Learning , 深度学习导论,NSF研究员Alexander Amini

专知会员服务

34+阅读 · 2019年11月2日

动手学深度学习Dive into Deep Learning中英文版本（附全套代码）

动手学深度学习Dive into Deep Learning中英文版本（附全套代码）

专知会员服务

111+阅读 · 2019年10月26日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

基于大型语言模型的网络威胁情报：利用LLM提取MITRE ATT&CK技术 | 最新文献

无人机（UAV）战略：区域大国与暴力非国家行为体在中东冲突中对无人机的运用 | 130页

神经技术与未来无人机战争的交汇点 | 最新报告

美国从“蛛网行动”中汲取轰炸机舰队保护教训

相关资讯

已删除

将门创投

5+阅读 · 2019年8月19日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

Learning Tree-based Deep Model for Recommender Systems

Arxiv

3+阅读 · 2018年12月21日

Learning Personalized End-to-End Goal-Oriented Dialog

Arxiv

4+阅读 · 2018年11月12日

Mean Field Multi-Agent Reinforcement Learning

Arxiv

5+阅读 · 2018年6月12日

Relational Deep Reinforcement Learning

Arxiv

5+阅读 · 2018年6月5日

MARS: Memory Attention-Aware Recommender System

Arxiv

6+阅读 · 2018年5月18日

Generative Model for Heterogeneous Inference

Arxiv

4+阅读 · 2018年4月26日

Generative Adversarial Autoencoder Networks

Arxiv

11+阅读 · 2018年3月23日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

Denoising Adversarial Autoencoders

Arxiv

9+阅读 · 2018年1月4日

Deep CTR Prediction in Display Advertising

Arxiv

4+阅读 · 2016年9月20日

微信扫码咨询专知VIP会员