Consider a service system where incoming tasks are instantaneously dispatched to one out of many heterogeneous server pools. Associated with each server pool is a concave utility function which depends on the class of the server pool and its current occupancy. We derive an upper bound for the mean normalized aggregate utility in stationarity and introduce two load balancing policies that achieve this upper bound in a large-scale regime. Furthermore, the transient and stationary behavior of these asymptotically optimal load balancing policies is characterized on the scale of the number of server pools, in the same large-scale regime.
翻译:考虑一种服务系统,其中传入的任务会立即分配给许多异构的服务器池之一。每个服务器池都与一个凹效用函数相关联,该函数取决于服务器池的类别和其当前占用情况。我们推导出一个平稳状态下的平均归一化聚合效用的上限,并引入两种负载均衡策略,在大规模范围内实现该上限。此外,在同一大规模范围内,表征这些渐进最优负载均衡策略的瞬态和稳态行为,以服务器池数量为尺度。