Under the Markov decision process (MDP) congestion game framework, we study the problem of enforcing population distribution constraints on a population of players with stochastic dynamics and coupled congestion costs. Existing research demonstrates that the constraints on the players' population distribution can be satisfied by enforcing tolls. However, computing the minimum toll value for constraint satisfaction requires accurate modeling of the player's congestion costs. Motivated by settings where an accurate congestion cost model is unavailable (e.g. transportation networks), we consider an MDP congestion game with unknown congestion costs. We assume that a constraint-enforcing authority can repeatedly enforce tolls on a population of players who converges to an $\epsilon$-optimal population distribution for any given toll. We then construct a myopic update algorithm to compute the minimum toll value while ensuring that the constraints are satisfied on average. We analyze how the players' sub-optimal responses to tolls impact the rates of convergence towards the minimum toll value and constraint satisfaction. Finally, we apply our results to transportation by building a high-fidelity game model using data from the New York City's (NYC) Taxi and Limousine Commission (TLC), and illustrate how to efficiently reduce congestion while minimizing the impact on driver earnings.
翻译:根据Markov决定(MDP)拥堵游戏框架,我们研究对具有随机动态和同时的拥堵成本的玩家群体实施人口分布限制的问题。现有的研究表明,对玩家人口分布的限制可以通过强制收费来满足。然而,计算限制满意度的最低收费值需要准确模拟玩家的拥堵成本。受无法准确的堵塞成本模型(例如交通网络)的环境驱动,我们考虑的是MDP拥挤游戏,其拥挤成本未知。我们假定,限制强化当局可以反复对聚集在美元/欧元-美元-美元-最优人口分布的玩家群体实施收费限制。我们随后将一个超理想的计算算法来计算最低收费值,同时确保平均满足这些制约因素。我们分析玩家对降低最低收费和制约满意度的趋同率的次最佳反应。最后,我们运用我们的成果,利用来自纽约州立航空和交通委员会的数据,构建一个高密度游戏模型,降低纽约市的交通成本,同时减少交通成本。