This paper studies a dynamic discrete-time queuing model where at every period players get a new job and must send all their jobs to a queue that has a limited capacity. Players have an incentive to send their jobs as late as possible; however if a job does not exit the queue by a fixed deadline, the owner of the job incurs a penalty and this job is sent back to the player and joins the queue at the next period. Therefore, stability, i.e. the boundedness of the number of jobs in the system, is not guaranteed. We show that if players are myopically strategic, then the system is stable when the penalty is high enough. Moreover, if players use a learning algorithm derived from a typical no-regret algorithm (exponential weight), then the system is stable when penalties are greater than a bound that depends on the total number of jobs in the system.
翻译:本文研究一种动态的离散时间排队模式, 每段时期, 球员都会得到一份新的工作, 并且必须把他们的所有工作发送到一个容量有限的队列中。 球员有尽可能晚地发送工作的动力; 但是, 如果一个工作没有在固定的最后期限前退出队列, 工作所有人将受到处罚, 这份工作被送回给球员, 并在下一个时期加入队列。 因此, 稳定性, 即系统中工作数量的界限, 得不到保障 。 我们显示, 如果球员是近似战略性的, 那么当惩罚足够高时, 系统就会稳定下来 。 此外, 如果球员使用一种学习算法, 从典型的无限制算法( 超重) 中得出, 当惩罚大于取决于系统中工作总数的约束值时, 系统就会稳定 。