通过政策迭代对定态标记链条进行最佳运输 (Optimal Transport for Stationary Markov Chains via Policy Iteration)

We study the optimal transport problem for pairs of stationary finite-state Markov chains, with an emphasis on the computation of optimal transition couplings. Transition couplings are a constrained family of transport plans that capture the dynamics of Markov chains. Solutions of the optimal transition coupling (OTC) problem correspond to alignments of the two chains that minimize long-term average cost. We establish a connection between the OTC problem and Markov decision processes, and show that solutions of the OTC problem can be obtained via an adaptation of policy iteration. For settings with large state spaces, we develop a fast approximate algorithm based on an entropy-regularized version of the OTC problem, and provide bounds on its per-iteration complexity. We establish a stability result for both the regularized and unregularized algorithms, from which a statistical consistency result follows as a corollary. We validate our theoretical results empirically through a simulation study, demonstrating that the approximate algorithm exhibits faster overall runtime with low error. Finally, we extend the setting and application of our methods to hidden Markov models, and illustrate the potential use of the proposed algorithms in practice with an application to computer-generated music.

翻译：我们研究了固定状态有限马可夫链条的最佳运输问题,重点是计算最佳过渡结合。过渡结合是一个有限的运输计划组合,捕捉马尔科夫链条的动态。最佳过渡结合(OTC)问题的解决方案与两个链条的匹配相吻合,从而将长期平均成本降至最低。我们在OTC问题与Markov决定程序之间建立了联系,并表明通过调整政策迭代可以找到OTC问题的最佳运输问题的解决办法。对于具有大型国家空间的设置,我们根据OTC问题成像化版本制定快速的近似算法,并提供其视光化复杂性的界限。我们为常规和非常规化算法建立稳定的结果,由此得出一个必然的统计一致性结果。我们通过模拟研究来验证我们的理论结果,表明近似算法的显示整个运行时间比较快,误差较小。最后,我们将我们方法的设置和应用扩大到隐藏的Markov模型,并展示了应用计算机成像音乐的拟议算法在实际中的潜在用途。