随机决策Petri网 (Stochastic Decision Petri Nets) - 专知论文

会员服务 ·

0

控制器 · Markov · CASES · 阈值 · Processing（编程语言） ·

2023 年 3 月 23 日

Stochastic Decision Petri Nets

翻译：随机决策Petri网

Florian Wittbold,Rebecca Bernemann,Reiko Heckel,Tobias Heindel,Barbara König

We introduce stochastic decision Petri nets (SDPNs), which are a form of stochastic Petri nets equipped with rewards and a control mechanism via the deactivation of controllable transitions. Such nets can be translated into Markov decision processes (MDPs), potentially leading to a combinatorial explosion in the number of states due to concurrency. Hence we restrict ourselves to instances where nets are either safe, free-choice and acyclic nets (SAFC nets) or even occurrence nets and policies are defined by a constant deactivation pattern. We obtain complexity-theoretic results for such cases via a close connection to Bayesian networks, in particular we show that for SAFC nets the question whether there is a policy guaranteeing a reward above a certain threshold is $\mathsf{NP}^\mathsf{PP}$-complete. We also introduce a partial-order procedure which uses an SMT solver to address this problem.

翻译：我们引入了随机决策Petri网（SDPNs），它们是一种具有奖励和通过停用可控变迁的控制机制的随机Petri网。这种网可以被转化为马尔可夫决策过程（MDPs），由于并发性可能导致状态数量的组合爆炸。因此，我们将自己限制在实例的范围内，其中网是安全、自由选择和无环网（SAFC网），甚至是发生网，策略由恒定的停用模式定义。我们通过与贝叶斯网络的紧密联系获得了这种情况的复杂性理论结果，特别是我们表明，对于SAFC网，问题是否有一种策略可以保证获得高于某个阈值的奖励是$\mathsf{NP}^\mathsf{PP}$-完全的。我们还介绍了一种部分顺序过程，它使用SMT求解器来解决此问题。

0

相关内容

控制器

神经网络如何推理算法？DeepMind Petar等LoG 2022 《神经算法推理》教程，系统性讲解神经网络与经典算法结合

神经网络如何推理算法？DeepMind Petar等LoG 2022 《神经算法推理》教程，系统性讲解神经网络与经典算法结合

专知会员服务

31+阅读 · 2022年12月22日

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知会员服务

51+阅读 · 2022年10月22日

【干货书】深度学习数学：理解神经网络，347页pdf

【干货书】深度学习数学：理解神经网络，347页pdf

专知会员服务

267+阅读 · 2022年7月3日

【经典书】量化金融导论，192页pdf，哈佛大学Stephen Blyth著作

【经典书】量化金融导论，192页pdf，哈佛大学Stephen Blyth著作

专知会员服务

97+阅读 · 2022年4月3日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

腊月廿八 | 强化学习-TRPO和PPO背后的数学

腊月廿八 | 强化学习-TRPO和PPO背后的数学

AI研习社

18+阅读 · 2019年2月2日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

随机偏微分方程

国家自然科学基金

5+阅读 · 2017年12月31日

紧区间上保向微分同胚的光滑嵌入流

国家自然科学基金

0+阅读 · 2015年12月31日

Sestrin2/AMPK信号通路调控新生鼠缺氧缺血脑损伤细胞自噬的新机制

国家自然科学基金

0+阅读 · 2015年12月31日

ABL基因剪接体DelE7-8-9和DelE8-9与慢性粒细胞白血病TKIs耐药的相关性研究

国家自然科学基金

0+阅读 · 2013年12月31日

交互式Petri网及其兼容性研究

国家自然科学基金

0+阅读 · 2012年12月31日

高维数据的假设检验

国家自然科学基金

0+阅读 · 2012年12月31日

基于时变因子动态耦合的水库调度收益-风险均衡策略研究

国家自然科学基金

0+阅读 · 2012年12月31日

离散事件动态系统的模糊概率模型及最优监控研究

国家自然科学基金

0+阅读 · 2012年12月31日

人类视觉认知与多尺度遥感图像智能化处理方法研究

国家自然科学基金

1+阅读 · 2011年12月31日

NO3-胁迫下黄瓜CsNMAPK介导的信号转导途径研究

国家自然科学基金

0+阅读 · 2009年12月31日

A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes

Arxiv

0+阅读 · 2023年5月15日

A Unified Analysis of Nonstochastic Delayed Feedback for Combinatorial Semi-Bandits, Linear Bandits, and MDPs

Arxiv

0+阅读 · 2023年5月15日

Maximal and minimal dynamic Petri net slicing

Arxiv

0+阅读 · 2023年5月15日

An Information-Spectrum Approach to Distributed Hypothesis Testing for General Sources

Arxiv

0+阅读 · 2023年5月11日

Matrix tri-factorization over the tropical semiring

Arxiv

0+阅读 · 2023年5月11日

Inference in Cluster Randomized Trials with Matched Pairs

Arxiv

0+阅读 · 2023年5月10日

Optimal mixing of the down-up walk on independent sets of a given size

Arxiv

0+阅读 · 2023年5月10日

Bayesian variance change point detection with credible sets

Arxiv

0+阅读 · 2023年5月10日

Decidability of Two Truly Concurrent Equivalences for Finite Bounded Petri Nets

Arxiv

0+阅读 · 2023年5月10日

Fast Attention Requires Bounded Entries

Arxiv

0+阅读 · 2023年5月9日

VIP会员

文章信息

相关主题

Processing（编程语言）

相关VIP内容

神经网络如何推理算法？DeepMind Petar等LoG 2022 《神经算法推理》教程，系统性讲解神经网络与经典算法结合

神经网络如何推理算法？DeepMind Petar等LoG 2022 《神经算法推理》教程，系统性讲解神经网络与经典算法结合

专知会员服务

31+阅读 · 2022年12月22日

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知会员服务

51+阅读 · 2022年10月22日

【干货书】深度学习数学：理解神经网络，347页pdf

【干货书】深度学习数学：理解神经网络，347页pdf

专知会员服务

267+阅读 · 2022年7月3日

【经典书】量化金融导论，192页pdf，哈佛大学Stephen Blyth著作

【经典书】量化金融导论，192页pdf，哈佛大学Stephen Blyth著作

专知会员服务

97+阅读 · 2022年4月3日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

从社会学实验到行为仿真：理解基于Agent的观点动力学建模思维

中英文版《GPT-5 System Card速览》报告

ACL 2025 | 大模型结构化知识提示的泛化能力研究

【普林斯顿博士论文】大型模型的高效推理

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

腊月廿八 | 强化学习-TRPO和PPO背后的数学

腊月廿八 | 强化学习-TRPO和PPO背后的数学

AI研习社

18+阅读 · 2019年2月2日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

相关论文

A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes

Arxiv

0+阅读 · 2023年5月15日

A Unified Analysis of Nonstochastic Delayed Feedback for Combinatorial Semi-Bandits, Linear Bandits, and MDPs

Arxiv

0+阅读 · 2023年5月15日

Maximal and minimal dynamic Petri net slicing

Arxiv

0+阅读 · 2023年5月15日

An Information-Spectrum Approach to Distributed Hypothesis Testing for General Sources

Arxiv

0+阅读 · 2023年5月11日

Matrix tri-factorization over the tropical semiring

Arxiv

0+阅读 · 2023年5月11日

Inference in Cluster Randomized Trials with Matched Pairs

Arxiv

0+阅读 · 2023年5月10日

Optimal mixing of the down-up walk on independent sets of a given size

Arxiv

0+阅读 · 2023年5月10日

Bayesian variance change point detection with credible sets

Arxiv

0+阅读 · 2023年5月10日

Decidability of Two Truly Concurrent Equivalences for Finite Bounded Petri Nets

Arxiv

0+阅读 · 2023年5月10日

Fast Attention Requires Bounded Entries

Arxiv

0+阅读 · 2023年5月9日

相关基金

随机偏微分方程

国家自然科学基金

5+阅读 · 2017年12月31日

紧区间上保向微分同胚的光滑嵌入流

国家自然科学基金

0+阅读 · 2015年12月31日

Sestrin2/AMPK信号通路调控新生鼠缺氧缺血脑损伤细胞自噬的新机制

国家自然科学基金

0+阅读 · 2015年12月31日

ABL基因剪接体DelE7-8-9和DelE8-9与慢性粒细胞白血病TKIs耐药的相关性研究

国家自然科学基金

0+阅读 · 2013年12月31日

交互式Petri网及其兼容性研究

国家自然科学基金

0+阅读 · 2012年12月31日

高维数据的假设检验

国家自然科学基金

0+阅读 · 2012年12月31日

基于时变因子动态耦合的水库调度收益-风险均衡策略研究

国家自然科学基金

0+阅读 · 2012年12月31日

离散事件动态系统的模糊概率模型及最优监控研究

国家自然科学基金

0+阅读 · 2012年12月31日

人类视觉认知与多尺度遥感图像智能化处理方法研究

国家自然科学基金

1+阅读 · 2011年12月31日

NO3-胁迫下黄瓜CsNMAPK介导的信号转导途径研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员