We consider repeated first price auctions where each bidder, having a deterministic type, learns to bid using a mean-based learning algorithm. We completely characterize the Nash convergence property of the bidding dynamics in two senses: (1) time-average: the fraction of rounds where bidders play a Nash equilibrium approaches to 1 in the limit; (2) last-iterate: the mixed strategy profile of bidders approaches to a Nash equilibrium in the limit. Specifically, the results depend on the number of bidders with the highest value: - If the number is at least three, the bidding dynamics almost surely converges to a Nash equilibrium of the auction, both in time-average and in last-iterate. - If the number is two, the bidding dynamics almost surely converges to a Nash equilibrium in time-average but not necessarily in last-iterate. - If the number is one, the bidding dynamics may not converge to a Nash equilibrium in time-average nor in last-iterate. Our discovery opens up new possibilities in the study of convergence dynamics of learning algorithms.
翻译:我们考虑的是重复的第一次价格拍卖,即每个投标人具有确定性,学会使用一种基于平均值的学习算法进行投标。我们完全从两个意义上描述出投标动态的纳什趋同特性:(1) 平均时间:(1) 投标人将纳什均衡方法玩到极限1的回合的分数;(2) 最晚时间:投标人对纳什平衡方法的混合战略剖面在极限中达到极限。具体地说,结果取决于价值最高的投标人的数目:—如果数字至少是三个,投标动态几乎肯定会与拍卖的纳什平衡一致,无论在时间平均还是最后时间上。—如果数字是两个,投标动态几乎肯定会在时间平均和最后时间上达到纳什平衡。—如果数字是一个,投标动态可能不会在时间平均或最后时间上达到纳什平衡。我们发现,在学习算法的趋同动态研究中,新的可能性是存在的。