争取马尔科夫决定程序在时间上的原因 (Towards Causal Temporal Reasoning for Markov Decision Processes) - 专知论文

会员服务 ·

0

Markov · Processing（编程语言） · 操作 · 路径 · MoDELS ·

2022 年 12 月 16 日

Towards Causal Temporal Reasoning for Markov Decision Processes

翻译：争取马尔科夫决定程序在时间上的原因

Milad Kazemi,Nicola Paoletti

from arxiv, 27 pages and 8 figures

We introduce a new probabilistic temporal logic for the verification of Markov Decision Processes (MDP). Our logic is the first to include operators for causal reasoning, allowing us to express interventional and counterfactual queries. Given a path formula $\phi$, an interventional property is concerned with the satisfaction probability of $\phi$ if we apply a particular change $I$ to the MDP (e.g., switching to a different policy); a counterfactual allows us to compute, given an observed MDP path $\tau$, what the outcome of $\phi$ would have been had we applied $I$ in the past. For its ability to reason about different configurations of the MDP, our approach represents a departure from existing probabilistic temporal logics that can only reason about a fixed system configuration. From a syntactic viewpoint, we introduce a generalized counterfactual operator that subsumes both interventional and counterfactual probabilities as well as the traditional probabilistic operator found in e.g., PCTL. From a semantics viewpoint, our logic is interpreted over a structural causal model (SCM) translation of the MDP, which gives us a representation amenable to counterfactual reasoning. We provide a proof-of-concept evaluation of our logic on a reach-avoid task in a grid-world model.

翻译：我们引入了用于核实Markov决定过程(MDP)的新的概率时间逻辑。我们的逻辑首先包括了因果推理操作者,允许我们表达干预和反事实的询问。根据一条路径公式$\phe$,干预性财产涉及的是如果我们对MDP应用特定的改变美元(例如,转换到不同的政策);反事实允许我们计算出美元的结果,考虑到人们所观察到的MDP路径$\tau美元,如果过去我们应用的是美元,美元的结果本来会是什么。由于我们能够解释MDP的不同配置,我们的方法偏离了现有的概率时间逻辑,而这只能解释固定系统配置的理由。从合成角度看,我们引入了一个普遍的反事实操作者,从干预和反事实模型的不稳定性,以及从传统的概率操作者(例如,PCTL)中发现,如果我们过去应用了美元,那么美元的结果会是什么。为了解释MDP的不同配置,我们的逻辑是偏离了现有的概率时间逻辑逻辑,这只能说明固定系统配置的理由。从合成的角度,我们引入了一个普遍的反事实主义逻辑操作者,我们从一个真实的逻辑推论中为我们提供了一个稳定的逻辑推论的逻辑,从一个稳定的逻辑推论,从一个稳定的逻辑模型模型模型模型到一个正确的推论,我们提供了一个正确的推。

0

相关内容

Markov

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

252+阅读 · 2020年4月19日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

基于多目标优化的约束模式挖掘方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

场论中偏微分方程的涡旋解

国家自然科学基金

0+阅读 · 2014年12月31日

跨原子-连续介质水泥基材料应变率效应形成机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

强不定变分方法在若干非线性问题中的应用

国家自然科学基金

0+阅读 · 2013年12月31日

等离子体中原子光复合过程的相对论R矩阵理论研究

国家自然科学基金

0+阅读 · 2013年12月31日

三维椭圆方程Cauchy问题的正则化方法

国家自然科学基金

0+阅读 · 2013年12月31日

基于Nogo/NgR及其下游Rho/ROCK信号通路探讨电针治疗脊髓损伤的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

有限时间时滞混沌同步及其FPGA 实现

国家自然科学基金

0+阅读 · 2012年12月31日

量子discord及其在量子计算中的研究

国家自然科学基金

1+阅读 · 2011年12月31日

清脉饮及其拆方对动脉粥样硬化形成过程NF－κBmRNA的调控

国家自然科学基金

0+阅读 · 2011年12月31日

Towards Computationally Efficient Responsibility Attribution in Decentralized Partially Observable MDPs

Arxiv

0+阅读 · 2023年2月24日

Cox reduction and confidence sets of models: a theoretical elucidation

Arxiv

0+阅读 · 2023年2月24日

Intermittently Observable Markov Decision Processes

Arxiv

0+阅读 · 2023年2月23日

Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes

Arxiv

0+阅读 · 2023年2月22日

A Survey on Causal Reinforcement Learning

Arxiv

29+阅读 · 2023年2月10日

The FluidFlower International Benchmark Study: Process, Modeling Results, and Comparison to Experimental Data

Arxiv

0+阅读 · 2023年2月9日

A Survey of Deep Causal Model

Arxiv

45+阅读 · 2022年9月19日

Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond

Arxiv

21+阅读 · 2021年9月2日

A Survey on Causal Inference

Arxiv

112+阅读 · 2020年2月5日

Explainable Reasoning over Knowledge Graphs for Recommendation

Arxiv

11+阅读 · 2018年11月12日

VIP会员

文章信息

相关主题

Processing（编程语言）

相关VIP内容

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

252+阅读 · 2020年4月19日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《科研智能：人工智能赋能工业仿真研究报告（2025年）》

具身智能中的世界模型：全面综述

【NeurIPS2025】迈向开放世界的三维“物体性”学习

【博士论文】用于排序与扩散模型的安全、高效与鲁棒强化学习

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Towards Computationally Efficient Responsibility Attribution in Decentralized Partially Observable MDPs

Arxiv

0+阅读 · 2023年2月24日

Cox reduction and confidence sets of models: a theoretical elucidation

Arxiv

0+阅读 · 2023年2月24日

Intermittently Observable Markov Decision Processes

Arxiv

0+阅读 · 2023年2月23日

Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes

Arxiv

0+阅读 · 2023年2月22日

A Survey on Causal Reinforcement Learning

Arxiv

29+阅读 · 2023年2月10日

The FluidFlower International Benchmark Study: Process, Modeling Results, and Comparison to Experimental Data

Arxiv

0+阅读 · 2023年2月9日

A Survey of Deep Causal Model

Arxiv

45+阅读 · 2022年9月19日

Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond

Arxiv

21+阅读 · 2021年9月2日

A Survey on Causal Inference

Arxiv

112+阅读 · 2020年2月5日

Explainable Reasoning over Knowledge Graphs for Recommendation

Arxiv

11+阅读 · 2018年11月12日

相关基金

基于多目标优化的约束模式挖掘方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

场论中偏微分方程的涡旋解

国家自然科学基金

0+阅读 · 2014年12月31日

跨原子-连续介质水泥基材料应变率效应形成机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

强不定变分方法在若干非线性问题中的应用

国家自然科学基金

0+阅读 · 2013年12月31日

等离子体中原子光复合过程的相对论R矩阵理论研究

国家自然科学基金

0+阅读 · 2013年12月31日

三维椭圆方程Cauchy问题的正则化方法

国家自然科学基金

0+阅读 · 2013年12月31日

基于Nogo/NgR及其下游Rho/ROCK信号通路探讨电针治疗脊髓损伤的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

有限时间时滞混沌同步及其FPGA 实现

国家自然科学基金

0+阅读 · 2012年12月31日

量子discord及其在量子计算中的研究

国家自然科学基金

1+阅读 · 2011年12月31日

清脉饮及其拆方对动脉粥样硬化形成过程NF－κBmRNA的调控

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员