网上广告中多平台预算优化多平台预算优化的斯托卡猛匪 (Stochastic Bandits for Multi-platform Budget Optimization in Online Advertising)

We study the problem of an online advertising system that wants to optimally spend an advertiser's given budget for a campaign across multiple platforms, without knowing the value for showing an ad to the users on those platforms. We model this challenging practical application as a Stochastic Bandits with Knapsacks problem over $T$ rounds of bidding with the set of arms given by the set of distinct bidding $m$-tuples, where $m$ is the number of platforms. We modify the algorithm proposed in Badanidiyuru \emph{et al.,} to extend it to the case of multiple platforms to obtain an algorithm for both the discrete and continuous bid-spaces. Namely, for discrete bid spaces we give an algorithm with regret $O\left(OPT \sqrt {\frac{mn}{B} }+ \sqrt{mn OPT}\right)$, where $OPT$ is the performance of the optimal algorithm that knows the distributions. For continuous bid spaces the regret of our algorithm is $\tilde{O}\left(m^{1/3} \cdot \min\left\{ B^{2/3}, (m T)^{2/3} \right\} \right)$. When restricted to this special-case, this bound improves over Sankararaman and Slivkins in the regime $OPT \ll T$, as is the case in the particular application at hand. Second, we show an $ \Omega\left (\sqrt {m OPT} \right)$ lower bound for the discrete case and an $\Omega\left( m^{1/3} B^{2/3}\right)$ lower bound for the continuous setting, almost matching the upper bounds. Finally, we use a real-world data set from a large internet online advertising company with multiple ad platforms and show that our algorithms outperform common benchmarks and satisfy the required properties warranted in the real-world application.

翻译：我们研究一个在线广告系统的问题,这个系统希望最佳地花广告商的指定预算用于多个平台的竞选活动,而不知道在这些平台上向用户展示广告的价值。我们将这个具有挑战性的实用应用程序建为Stochatical bankities, 其Knapsacks 问题在于$T$T的投标回合, 由一组不同的投标提供一套武器, 美元tuples $m美元, 平台数量为$m。我们修改Badanidiyuru \ emph{et al.} 中提议的算法, 将其扩大到多个平台, 以获得离散和连续投标空间的算法。也就是说, 对于离散投标空间, 我们给出一个具有遗憾的算盘( OPT\\ right\ right) 问题, 美元=\\\\\ right\\\\\\\\\ right\ tricklate 的算法, 美元手数=tral 美元=tal lax a lax lax a lax lax lax lax lax lax lax lax lax lax lax a lax the lax lax lax lax lax lax lax lax lax lax lax lax lax lax lax lax lax lax) lax lax a, lax lax lax lax lad laut lax lax lax laut lax lax lax lax laxxxxxx lax laut laxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

相关内容

Continuity

关注 4

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

【SIGIR2021】ScaleFreeCTR：超大规模Embedding推荐模型分布式训练系统

专知会员服务

28+阅读 · 2021年4月26日

最新《图神经网络知识图谱补全》综述论文

专知会员服务

158+阅读 · 2020年7月29日

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

专知会员服务

19+阅读 · 2020年6月29日

【斯坦福】凸优化圣经- Convex Optimization （附730pdf下载）

专知会员服务

231+阅读 · 2020年6月5日