位于Bound-Parater Markov 裁决程序中的 $$$- 区域属性的可满足性弹道 (Satisfiability Bounds for $ω$-Regular Properties in Bounded-Parameter Markov Decision Processes) - 专知论文

会员服务 ·

0

Markov · Analysis · Processing（编程语言） · 转移概率 · 马尔可夫链 ·

2022 年 7 月 27 日

Satisfiability Bounds for $ω$-Regular Properties in Bounded-Parameter Markov Decision Processes

翻译：位于Bound-Parater Markov 裁决程序中的 $$$- 区域属性的可满足性弹道

Jan Křetínský,Tobias Meggendorfer,Maximilian Weininger

We consider the problem of computing minimum and maximum probabilities of satisfying an $\omega$-regular property in a bounded-parameter Markov decision process (BMDP). BMDP arise from Markov decision processes (MDP) by allowing for uncertainty on the transition probabilities in the form of intervals where the actual probabilities are unknown. $\omega$-regular languages form a large class of properties, expressible as, e.g., Rabin or parity automata, encompassing rich specifications such as linear temporal logic. In a BMDP the probability to satisfy the property depends on the unknown transitions probabilities as well as on the policy. In this paper, we compute the extreme values. This solves the problem specifically suggested by Dutreix and Coogan in CDC 2018, extending their results on interval Markov chains with no adversary. The main idea is to reinterpret their work as analysis of interval MDP and accordingly the BMDP problem as analysis of an $\omega$-regular stochastic game, where a solution is provided. This method extends smoothly further to bounded-parameter stochastic games.

翻译：我们考虑了在封闭的参数Markov(BMDP)决定程序中计算满足美元正值财产的最低和最大概率的问题。BMDP产生于Markov决定过程(MDP),允许在实际概率未知的情况下以间隔形式对过渡概率进行不确定性,从而在实际概率未知的情况下,以间隔形式对过渡概率进行计算。$omega-普通语言形成一大类属性,表现为拉宾或等等同自动式语言,包括大量规格,如线性时间逻辑等。在BMDP中,满足该财产的可能性取决于未知的过渡概率以及政策。我们在本文中计算了极端值。这解决了Dutreix和Cogan在CDC 2018年具体提出的问题,将其结果扩展至没有对手的间隔马尔科夫链。主要想法是将其工作作为间隔MDP的分析重新进行互换,并相应地将BMDP问题作为对美元正值常规的游戏的分析,其中提供了一种解决方案。这种方法向约束性游戏更顺利地延伸至约束式的模拟。

0

相关内容

Markov

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

专知会员服务

428+阅读 · 2021年1月11日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

供体-超原子受体（D-SA）超原子化物及其非线性光学性质的研究

国家自然科学基金

0+阅读 · 2016年12月31日

拓扑狄拉克半金属材料的自旋注入和自旋输运性质研究

国家自然科学基金

0+阅读 · 2015年12月31日

稀土掺杂铁电纳米线的发光可调研究

国家自然科学基金

0+阅读 · 2013年12月31日

Tip60在oxLDL诱导的血管平滑肌细胞自噬及增殖中的作用机制

国家自然科学基金

0+阅读 · 2013年12月31日

拓扑超导体的基态和元激发性质

国家自然科学基金

0+阅读 · 2012年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

拓扑半金属Sb薄膜的分子束外延生长、能带结构调控和原位同步辐射ARPES研究

国家自然科学基金

0+阅读 · 2012年12月31日

Tudor-SN蛋白调控细胞周期的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

离子掺杂CdSe及ZnSe半导体量子点纳米晶的制备与研究

国家自然科学基金

0+阅读 · 2011年12月31日

金属-半导体复合纳米结构的光学性质研究

国家自然科学基金

0+阅读 · 2008年12月31日

Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes

Arxiv

0+阅读 · 2022年9月18日

Polynomial formulations as a barrier for reduction-based hardness proofs

Arxiv

0+阅读 · 2022年9月18日

Follow-the-Perturbed-Leader for Adversarial Markov Decision Processes with Bandit Feedback

Arxiv

0+阅读 · 2022年9月18日

Exploring the Whole Rashomon Set of Sparse Decision Trees

Exploring the Whole Rashomon Set of Sparse Decision Trees

Arxiv

0+阅读 · 2022年9月16日

Stability and Generalization for Markov Chain Stochastic Gradient Methods

Arxiv

0+阅读 · 2022年9月16日

Coupling of finite element and boundary element methods with regularization for a nonlinear interface problem with nonmonotone set-valued transmission conditions

Arxiv

0+阅读 · 2022年9月16日

Private Stochastic Optimization in the Presence of Outliers: Optimal Rates for (Non-Smooth) Convex Losses and Extension to Non-Convex Losses

Arxiv

0+阅读 · 2022年9月15日

Pricing Optimal Outcomes in Coupled and Non-Convex Markets: Theory and Applications to Electricity Markets

Pricing Optimal Outcomes in Coupled and Non-Convex Markets: Theory and Applications to Electricity Markets

Arxiv

0+阅读 · 2022年9月15日

Robust Anytime Learning of Markov Decision Processes

Arxiv

0+阅读 · 2022年9月15日

Stochastic first-order methods for average-reward Markov decision processes

Arxiv

0+阅读 · 2022年9月15日

VIP会员

文章信息

相关主题

Processing（编程语言）

马尔可夫链

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

专知会员服务

428+阅读 · 2021年1月11日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型基准综述

《自适应训练辅助系统概念导论及其在空战指挥官加速培训中的应用》125页

【剑桥博士论文】多智能体学习中的神经多样性

以色列-伊朗空战：短暂而激烈冲突的启示

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes

Arxiv

0+阅读 · 2022年9月18日

Polynomial formulations as a barrier for reduction-based hardness proofs

Arxiv

0+阅读 · 2022年9月18日

Follow-the-Perturbed-Leader for Adversarial Markov Decision Processes with Bandit Feedback

Arxiv

0+阅读 · 2022年9月18日

Exploring the Whole Rashomon Set of Sparse Decision Trees

Exploring the Whole Rashomon Set of Sparse Decision Trees

Arxiv

0+阅读 · 2022年9月16日

Stability and Generalization for Markov Chain Stochastic Gradient Methods

Arxiv

0+阅读 · 2022年9月16日

Coupling of finite element and boundary element methods with regularization for a nonlinear interface problem with nonmonotone set-valued transmission conditions

Arxiv

0+阅读 · 2022年9月16日

Private Stochastic Optimization in the Presence of Outliers: Optimal Rates for (Non-Smooth) Convex Losses and Extension to Non-Convex Losses

Arxiv

0+阅读 · 2022年9月15日

Pricing Optimal Outcomes in Coupled and Non-Convex Markets: Theory and Applications to Electricity Markets

Pricing Optimal Outcomes in Coupled and Non-Convex Markets: Theory and Applications to Electricity Markets

Arxiv

0+阅读 · 2022年9月15日

Robust Anytime Learning of Markov Decision Processes

Arxiv

0+阅读 · 2022年9月15日

Stochastic first-order methods for average-reward Markov decision processes

Arxiv

0+阅读 · 2022年9月15日

相关基金

供体-超原子受体（D-SA）超原子化物及其非线性光学性质的研究

国家自然科学基金

0+阅读 · 2016年12月31日

拓扑狄拉克半金属材料的自旋注入和自旋输运性质研究

国家自然科学基金

0+阅读 · 2015年12月31日

稀土掺杂铁电纳米线的发光可调研究

国家自然科学基金

0+阅读 · 2013年12月31日

Tip60在oxLDL诱导的血管平滑肌细胞自噬及增殖中的作用机制

国家自然科学基金

0+阅读 · 2013年12月31日

拓扑超导体的基态和元激发性质

国家自然科学基金

0+阅读 · 2012年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

拓扑半金属Sb薄膜的分子束外延生长、能带结构调控和原位同步辐射ARPES研究

国家自然科学基金

0+阅读 · 2012年12月31日

Tudor-SN蛋白调控细胞周期的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

离子掺杂CdSe及ZnSe半导体量子点纳米晶的制备与研究

国家自然科学基金

0+阅读 · 2011年12月31日

金属-半导体复合纳米结构的光学性质研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员