We study a dynamic pricing and capacity sizing problem in a GI/GI/1 queue, where the service provider's objective is to obtain the optimal service fee $p$ and service capacity $\mu$ so as to maximize cumulative expected profit (the service revenue minus the staffing cost and delay penalty). Due to the complex nature of the queueing dynamics, such a problem has no analytic solution so that previous research often resorts to heavy-traffic analysis in that both the arrival rate and service rate are sent to infinity. In this work we propose an online learning framework designed for solving this problem which does not require the system's scale to increase. Our algorithm organizes the time horizon into successive operational cycles and prescribes an efficient procedure to obtain improved pricing and staffing policies in each cycle using data collected in previous cycles. Data here include the number of customer arrivals, waiting times, and the server's busy times. The ingenuity of this approach lies in its online nature, which allows the service provider do better by interacting with the environment. Effectiveness of our online learning algorithm is substantiated by (i) theoretical results including the algorithm convergence and regret analysis (with a logarithmic regret bound), and (ii) engineering confirmation via simulation experiments of a variety of representative GI/GI/1 queues.
翻译:在GI/GI/1队列中,我们研究动态定价和能力问题,在GI/GI/1队列中,服务供应商的目标是获得最佳服务费$p$和服务能力$ mu$,以最大限度地实现预期的累计利润(服务收入减去人事费和延迟罚款)。由于排队动态的复杂性,这样一个问题没有分析性的解决办法,因此以前的研究往往采用重型交易分析,即抵达率和服务率被送至无限范围。在这项工作中,我们提议了一个在线学习框架,旨在解决这一问题,不需要系统规模扩大。我们的算法将时间范围组织成连续运作周期,并规定了一个高效程序,利用以往周期收集的数据,在每个周期中获得更好的定价和人员配置政策。这里的数据包括到货客户人数、等候时间和服务器繁忙时间。这种方法的巧妙性在于其在线性质,它使服务供应商能够更好地与环境互动。我们的在线学习算法的有效性通过以下几个理论结果得到证实:(二) 包括算法趋同和遗憾分析(通过模拟进行对GI1/1的模拟、对GIA进行对数的模拟)。