Policy iteration techniques for multiple-server dispatching rely on the computation of value functions. In this context, we consider the continuous-space M/G/1-FCFS queue endowed with an arbitrarily-designed cost function for the waiting times of the incoming jobs. The associated relative value function is a solution of Poisson's equation for Markov chains, which in this work we solve in the Laplace transform domain by considering an ancillary, underlying stochastic process extended to (imaginary) negative backlog states. This construction enables us to issue closed-form relative value functions for polynomial and exponential cost functions and for piecewise compositions of the latter, in turn permitting the derivation of interval bounds for the relative value function in the form of power series or trigonometric sums. We review various cost approximation schemes and assess the convergence of the interval bounds these induce on the relative value function. Namely: Taylor expansions (divergent, except for a narrow class of entire functions with low orders of growth), and uniform approximation schemes (polynomials, trigonometric), which achieve optimal convergence rates over finite intervals. This study addresses all the steps to implementing dispatching policies for systems of parallel servers, from the specification of general cost functions towards the computation of interval bounds for the relative value functions and the exact implementation of the first-policy improvement step.
翻译:多服务器发送的政策传换技巧取决于对价值功能的计算。 在这方面, 我们考虑连续空间 M/ G/1- FFFS 队列, 为即将到来的工作的等待时间设置任意设计的成本功能。 相关的相对价值功能是Markov 链的Poisson方程式的解决方案, 我们在此工作中通过考虑一个辅助性、 基础性随机化进程, 扩展到( 想象性的) 负积压国家, 从而在Laplace 转换域。 这一构建使我们能够为多元和指数性成本函数以及后者的片断组合发布封闭式相对价值函数, 从而允许以权力序列或三角计量数的形式为相对价值函数设定间距线。 我们审查各种成本近似方案, 并评估这些间距的趋同点与相对价值函数的趋同。 即: Taylor 扩展( 差异性, 除了一个增长顺序较低的整个功能的狭小类别之外), 以及统一近似计划( polyomial, trgonology), 等式组合式组合, 从而在有限间隔内得出最佳趋同率的趋同率率率, 。 研究 执行整个服务器的相对精确的计算所有步骤, 的计算方法, 执行整个系统 执行总缩缩缩定的计算。