网上广告中多平台预算优化多平台预算优化的斯托卡猛匪 (Stochastic Bandits for Multi-platform Budget Optimization in Online Advertising)

We study the problem of an online advertising system that wants to optimally spend an advertiser's given budget for a campaign across multiple platforms, without knowing the value for showing an ad to the users on those platforms. We model this challenging practical application as a Stochastic Bandits with Knapsacks problem over $T$ rounds of bidding with the set of arms given by the set of distinct bidding $m$-tuples, where $m$ is the number of platforms. We modify the algorithm proposed in Badanidiyuru \emph{et al.,} to extend it to the case of multiple platforms to obtain an algorithm for both the discrete and continuous bid-spaces. Namely, for discrete bid spaces we give an algorithm with regret $O\left(OPT \sqrt {\frac{mn}{B} }+ \sqrt{mn OPT}\right)$, where $OPT$ is the performance of the optimal algorithm that knows the distributions. For continuous bid spaces the regret of our algorithm is $\tilde{O}\left(m^{1/3} \cdot \min\left\{ B^{2/3}, (m T)^{2/3} \right\} \right)$. When restricted to this special-case, this bound improves over Sankararaman and Slivkins in the regime $OPT \ll T$, as is the case in the particular application at hand. Second, we show an $ \Omega\left (\sqrt {m OPT} \right)$ lower bound for the discrete case and an $\Omega\left( m^{1/3} B^{2/3}\right)$ lower bound for the continuous setting, almost matching the upper bounds. Finally, we use a real-world data set from a large internet online advertising company with multiple ad platforms and show that our algorithms outperform common benchmarks and satisfy the required properties warranted in the real-world application.

翻译：我们研究一个在线广告系统的问题,这个系统希望最佳地花广告商的指定预算用于多个平台的竞选活动,而不知道在这些平台上向用户展示广告的价值。我们将这个具有挑战性的实用应用程序建为Stochatical bankities, 其Knapsacks 问题在于$T$T的投标回合, 由一组不同的投标提供一套武器, 美元tuples $m美元, 平台数量为$m。我们修改Badanidiyuru \ emph{et al.} 中提议的算法, 将其扩大到多个平台, 以获得离散和连续投标空间的算法。也就是说, 对于离散投标空间, 我们给出一个具有遗憾的算盘( OPT\\ right\ right) 问题, 美元=\\\\\ right\\\\\\\\\ right\ tricklate 的算法, 美元手数=tral 美元=tal lax a lax lax a lax lax lax lax lax lax lax lax lax lax lax a lax the lax lax lax lax lax lax lax lax lax lax lax lax lax lax lax lax lax lax) lax lax a, lax lax lax lax lad laut lax lax lax laut lax lax lax lax laxxxxxx lax laut laxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

相关内容

Continuity

关注 4

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【SIGIR2020】多检索系统的贝叶斯推理风险评估，Bayesian Inferential Risk Evaluation On Multiple IR Systems

专知会员服务

9+阅读 · 2020年6月10日

【NeurIPS 2019|经典论文奖】正则随机学习和在线优化的双重平均法（Dual Averaging Method for Regularized Stochastic Learning and Online Optimization），微软研究院Lin Xiao

专知会员服务

17+阅读 · 2019年12月9日