近于最佳的适应政策 (Near-Optimal Adaptive Policies for Serving Stochastically Departing Customers)

We consider a multi-stage stochastic optimization problem originally introduced by Cygan et al. (2013), studying how a single server should prioritize stochastically departing customers. In this setting, our objective is to determine an adaptive service policy that maximizes the expected total reward collected along a discrete planning horizon, in the presence of customers who are independently departing between one stage and the next with known stationary probabilities. In spite of its deceiving structural simplicity, we are unaware of non-trivial results regarding the rigorous design of optimal or truly near-optimal policies at present time. Our main contribution resides in proposing a quasi-polynomial-time approximation scheme for adaptively serving impatient customers. Specifically, letting $n$ be the number of underlying customers, our algorithm identifies in $O( n^{ O_{ \epsilon }( \log^2 n ) } )$ time an adaptive service policy whose expected reward is within factor $1 - \epsilon$ of the optimal adaptive reward. Our method for deriving this approximation scheme synthesizes various stochastic analyses in order to investigate how the adaptive optimum is affected by alteration to several instance parameters, including the reward values, the departure probabilities, and the collection of customers itself.

翻译：我们考虑的是Cygan等人(2013年)最初引入的多阶段随机优化问题(2013年),研究一个服务器应如何优先处理即将离任的客户。在这个背景下,我们的目标是确定适应性服务政策,在客户独立离开一个阶段和下一个阶段,已知的固定概率的情况下,在离任阶段之间,最大限度地提高在离任规划视野内收集的预期总报酬。尽管我们忽视了结构简单性,但我们不知道目前严格设计最佳或真正接近最佳政策的非三进制结果。我们的主要贡献在于为适应性不耐烦的客户提出准球时近似近似计划。具体地说,让美元成为基本客户的数量,我们的算法在$O(n ⁇ ⁇ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ } ( \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \