Throttling is one of the most popular budget control methods in today's online advertising markets. When a budget-constrained advertiser employs throttling, she can choose whether or not to participate in an auction after the advertising platform recommends a bid. This paper focuses on the dynamic budget throttling process in repeated second-price auctions from a theoretical view. An essential feature of the underlying problem is that the advertiser does not know the distribution of the highest competing bid upon entering the market. To model the difficulty of eliminating such uncertainty, we consider two different information structures. The advertiser could obtain the highest competing bid in each round with full-information feedback. Meanwhile, with partial information feedback, the advertiser could only have access to the highest competing bid in the auctions she participates in. We propose the OGD-CB algorithm, which involves simultaneous distribution learning and revenue optimization. In both settings, we demonstrate that this algorithm guarantees an $O(\sqrt{T\log T})$ regret with probability $1 - O(1/T)$ relative to the fluid adaptive throttling benchmark. By proving a lower bound of $\Omega(\sqrt{T})$ on the minimal regret for even the hindsight optimum, we establish the near optimality of our algorithm. Finally, we compare the fluid optimum of throttling to that of pacing, another widely adopted budget control method. The numerical relationship of these benchmarks sheds new light on the understanding of different online algorithms for revenue maximization under budget constraints.
翻译:在当今网上广告市场中,最受欢迎的预算控制方法之一就是 hlottling 。当受预算限制的广告商在网上广告市场上进行抽动时,她可以选择在广告平台建议出价后是否参加拍卖。本文侧重于从理论角度反复进行的二次价格拍卖中的动态预算抽动过程。根本问题的一个特征是广告商在进入市场时不知道最高竞价的分配情况。要模拟消除这种不确定性的困难,我们考虑两种不同的信息结构。广告商可以在每轮中以完整信息反馈的方式获得最高竞价报价。同时,在部分信息反馈的情况下,广告商只能获得她参加的拍卖中最高竞价报价。我们建议OGD-CB算法,这涉及到同时进行分配学习和收入优化。在这两种情况下,我们证明这一算法保证了美元(sqrt{T} ) 的销售额分配。我们采用的概率为1-O1/T美元,比液调制调试基准要低。通过证明最接近最接近最佳预算的汇率比值,我们最接近最接近最接近于最接近最接近最接近最接近最接近最接近最接近最理想的预算方法。