Online auctions are one of the most fundamental facets of the modern economy and power an industry generating hundreds of billions of dollars a year in revenue. Auction theory has historically focused on the question of designing the best way to sell a single item to potential buyers, with the concurrent objectives of maximizing revenue generated or welfare created. Theoretical results in this area have typically relied on some prior Bayesian knowledge agents were assumed to have on each-other. This assumption is no longer satisfied in new markets such as online advertising: similar items are sold repeatedly, and agents are unaware of each other or might try to manipulate each-other. On the other hand, statistical learning theory now provides tools to supplement those missing pieces of information given enough data, as agents can learn from their environment to improve their strategies. This survey covers recent advances in learning in repeated auctions, starting from the traditional economic study of optimal one-shot auctions with a Bayesian prior. We then focus on the question of learning optimal mechanisms from a dataset of bidders' past values. The sample complexity as well as the computational efficiency of different methods will be studied. We will also investigate online variants where gathering data has a cost to be accounted for, either by seller or buyers ("earning while learning"). Later in the survey, we will further assume that bidders are also adaptive to the mechanism as they interact repeatedly with the same seller. We will show how strategic agents can actually manipulate repeated auctions, to their own advantage. All the questions discussed in this survey are grounded in real-world applications and many of the ideas and algorithms we describe are used every day to power the Internet economy.
翻译:在线拍卖是现代经济和电力行业每年产生数千亿美元收入的最根本方面之一。 拍卖理论历来侧重于设计向潜在买家出售单一物品的最佳方法,同时要最大限度地增加收入或创造福利。 该领域的理论结果通常依赖于以前巴耶斯人的知识代理人对彼此进行的最佳拍卖的传统经济研究。 在网上广告等新市场,这一假设不再令人满意:类似项目反复出售,代理商彼此不知情,或者可能试图相互操纵。 另一方面,统计学习理论现在提供了工具,以补充那些缺失的信息,提供足够数据,因为代理商可以从环境学习如何改进战略。本调查涵盖在多次拍卖中学习的最新进展,从以往巴耶斯人认为最佳的一发拍卖代理商对彼此有利。 我们然后侧重于从一个投标人过去价值的数据集中学习最佳机制的问题。 抽样复杂性以及不同方法的计算效率将受到研究。 另一方面,我们还将通过在线变量来补充这些缺失的算法, 因为代理商们可以从环境中学习到不断的变换价, 并且我们学习这些变价机制。