定点 Markov 链中固定分布量比特大小的优化比特边框 (Optimal bounds for bit-sizes of stationary distributions in finite Markov chains)

An irreducible stochastic matrix with rational entries has a stationary distribution given by a vector of rational numbers. We give an upper bound on the lowest common denominator of the entries of this vector. Bounds of this kind are used to study the complexity of algorithms for solving stochastic mean payoff games. They are usually derived using the Hadamard inequality, but this leads to suboptimal results. We replace the Hadamard inequality with the Markov chain tree formula in order to obtain optimal bounds. We also adapt our approach to obtain bounds on the absorption probabilities of finite Markov chains and on the gains and bias vectors of Markov chains with rewards.

翻译：具有合理条目的不可减少的随机矩阵和合理条目具有由理性数字矢量提供的固定分布。我们给此矢量条目最小公分母上一个上限。使用这种界子来研究解决随机中值报酬游戏的算法复杂性。它们通常使用哈达马德的不平等来产生, 但结果不尽理想。我们用马可夫链条树公式来取代哈达马德的不平等, 以获得最佳界限。我们还调整我们的方法, 以获得限定的马尔科夫链的吸收概率以及马尔科夫链条的收益和偏向矢量的界限。

相关内容

马尔可夫链

关注 289

马尔可夫链，因安德烈·马尔可夫（A.A.Markov，1856－1922）得名，是指数学中具有马尔可夫性质的离散事件随机过程。该过程中，在给定当前知识或信息的情况下，过去（即当前以前的历史状态）对于预测将来（即当前以后的未来状态）是无关的。在马尔可夫链的每一步，系统根据概率分布，可以从一个状态变到另一个状态，也可以保持当前状态。状态的改变叫做转移，与不同的状态改变相关的概率叫做转移概率。随机漫步就是马尔可夫链的例子。随机漫步中每一步的状态是在图形中的点，每一步可以移动到任何一个相邻的点，在这里移动到每一个点的概率都是相同的（无论之前漫步路径是如何的）。

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【ICCV 2019】贝叶斯优化的1-Bit CNNs 《Bayesian Optimized 1-Bit CNNs》

专知会员服务

16+阅读 · 2019年11月17日

【O'Reilly AI Conference 2019】目标消费者在多个领域的受众预测：NER和贝叶斯方法（Audience projection of target consumers over multiple domains: A NER and Bayesian approach），Helixa的首席科学家兼AI负责人Gianmario Spacagna

专知会员服务

5+阅读 · 2019年11月5日