无限期Horizon Markov决定程序正式核实的解决方法 (Formally Verified Solution Methods for Infinite-Horizon Markov Decision Processes) - 专知论文

会员服务 ·

0

Markov · Processing（编程语言） · Analysis · dynamic programming · INTERACT ·

2022 年 6 月 5 日

Formally Verified Solution Methods for Infinite-Horizon Markov Decision Processes

翻译：无限期Horizon Markov决定程序正式核实的解决方法

Maximilian Schäfeller,Mohammad Abdulaziz

We formally verify executable algorithms for solving Markov decision processes (MDPs) in the interactive theorem prover Isabelle/HOL. We build on existing formalizations of probability theory to analyze the expected total reward criterion on infinite-horizon problems. Our developments formalize the Bellman equation and give conditions under which optimal policies exist. Based on this analysis, we verify dynamic programming algorithms to solve tabular MDPs. We evaluate the formally verified implementations experimentally on standard problems and show they are practical. Furthermore, we show that, combined with efficient unverified implementations, our system can compete with and even outperform state-of-the-art systems.

翻译：我们在互动理论证明Isabelle/HOL中正式核实解决Markov决定程序(MDPs)的可执行算法。我们利用现有的概率理论正规化来分析无限正数问题的预期总报酬标准。我们的发展将贝尔曼方程式正式化,并为存在最佳政策提供条件。基于这一分析, 我们核查动态程序算法, 以解决表格式MDPs。我们根据标准问题对经正式核实的执行法进行了实验, 并表明这些执行法是切实可行的。此外, 我们证明,与高效的、未经核实的执行法相结合, 我们的系统可以与最先进的系统竞争, 甚至优于最先进的系统。

0

相关内容

Markov

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

非凸稀疏正则化模型与算法的研究

国家自然科学基金

3+阅读 · 2015年12月31日

TMS1基因响应高温胁迫和ER Stress的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

量子简并气体的模型约化和高效快速算法设计

国家自然科学基金

0+阅读 · 2013年12月31日

椭圆和抛物方程解的边界正则性

国家自然科学基金

0+阅读 · 2013年12月31日

广义Fermat猜想与相关的丢番图方程

国家自然科学基金

1+阅读 · 2009年12月31日

Log Barriers for Safe Black-box Optimization with Application to Safe Reinforcement Learning

Arxiv

0+阅读 · 2022年7月21日

Convergence analysis of multi-step one-shot methods for linear inverse problems

Arxiv

0+阅读 · 2022年7月21日

Domain Decomposition Learning Methods for Solving Elliptic Problems

Arxiv

0+阅读 · 2022年7月21日

Exploration of Parameter Spaces Assisted by Machine Learning

Arxiv

0+阅读 · 2022年7月20日

Graph Neural Networks for Recommender Systems: Challenges, Methods, and Directions

Arxiv

31+阅读 · 2021年9月27日

VIP会员

文章信息

相关主题

Processing（编程语言）

dynamic programming

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《基于AI的动态任务分配策略实现多智能体系统有意义人类控制》报告

《超越连接：AI驱动网络未来愿景》最新报告

人工智能赋能多域作战：能力与挑战

《战场空间决策优势：AI基础与应用研究》总结报告

相关资讯

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Log Barriers for Safe Black-box Optimization with Application to Safe Reinforcement Learning

Arxiv

0+阅读 · 2022年7月21日

Convergence analysis of multi-step one-shot methods for linear inverse problems

Arxiv

0+阅读 · 2022年7月21日

Domain Decomposition Learning Methods for Solving Elliptic Problems

Arxiv

0+阅读 · 2022年7月21日

Exploration of Parameter Spaces Assisted by Machine Learning

Arxiv

0+阅读 · 2022年7月20日

Graph Neural Networks for Recommender Systems: Challenges, Methods, and Directions

Arxiv

31+阅读 · 2021年9月27日

相关基金

非凸稀疏正则化模型与算法的研究

国家自然科学基金

3+阅读 · 2015年12月31日

TMS1基因响应高温胁迫和ER Stress的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

量子简并气体的模型约化和高效快速算法设计

国家自然科学基金

0+阅读 · 2013年12月31日

椭圆和抛物方程解的边界正则性

国家自然科学基金

0+阅读 · 2013年12月31日

广义Fermat猜想与相关的丢番图方程

国家自然科学基金

1+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员