Markov 决策过程轨迹的非正式预测间隔 (Conformal Prediction Intervals for Markov Decision Process Trajectories) - 专知论文

会员服务 ·

0

Conformer · Markov · Processing（编程语言） · 泛函 · 控制器 ·

2022 年 6 月 10 日

Conformal Prediction Intervals for Markov Decision Process Trajectories

翻译：Markov 决策过程轨迹的非正式预测间隔

Thomas G. Dietterich,Jesse Hostetler

from arxiv, 25 pages, 15 figures, 2 tables

Before delegating a task to an autonomous system, a human operator may want a guarantee about the behavior of the system. This paper extends previous work on conformal prediction for functional data and conformalized quantile regression to provide conformal prediction intervals over the future behavior of an autonomous system executing a fixed control policy on a Markov Decision Process (MDP). The prediction intervals are constructed by applying conformal corrections to prediction intervals computed by quantile regression. The resulting intervals guarantee that with probability $1-\delta$ the observed trajectory will lie inside the prediction interval, where the probability is computed with respect to the starting state distribution and the stochasticity of the MDP. The method is illustrated on MDPs for invasive species management and StarCraft2 battles.

翻译：在将一项任务委托给一个自主系统之前,人类操作者可能需要对该系统的行为提供保证。本文件扩展了以前关于功能数据和符合性四分位回归的一致预测工作,以提供对一个自主系统未来行为进行一致预测的间隔,该自主系统对Markov决定程序(MDP)实施固定控制政策。预测间隔是通过对以四分位回归计算的预测间隔进行一致校正来构建的。由此得出的间隔保证,以1美元-德尔塔元的概率,观察到的轨道将位于预测间隔内,在此间隔内,对MDP的起始状态分布和随机性进行计算。该方法在入侵物种管理和StarCraft2战斗的 MDPs上作了说明。

0

相关内容

Conformer

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Call for Nominations: 2022 Multimedia Prize Paper Award

Call for Nominations: 2022 Multimedia Prize Paper Award

CCF多媒体专委会

0+阅读 · 2022年2月12日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

PP2Cδ调控的线粒体ROS通路在肺损伤和炎症中的作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Aubry-Mather理论在弱光滑平面微分系统中的应用

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

YAP2在神经祖细胞增殖维持和分化中的作用及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

Evaluation of creating scoring opportunities for teammates in soccer via trajectory prediction

Arxiv

0+阅读 · 2022年7月27日

Motion Planning in Dynamic Environments Using Context-Aware Human Trajectory Prediction

Arxiv

1+阅读 · 2022年7月26日

On the Interaction between Test-Suite Reduction and Regression-Test Selection Strategies

Arxiv

0+阅读 · 2022年7月26日

Statistical and Computational Trade-offs in Variational Inference: A Case Study in Inferential Model Selection

Statistical and Computational Trade-offs in Variational Inference: A Case Study in Inferential Model Selection

Arxiv

0+阅读 · 2022年7月22日

Optimal Model Averaging of Support Vector Machines in Diverging Model Spaces

Arxiv

0+阅读 · 2022年7月22日

VIP会员

文章信息

相关主题

Processing（编程语言）

相关VIP内容

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【书籍】从零开始构建文本生成图像生成器：基于 Transformers 与扩散模型

人工智能与未来指挥

【伯克利博士论文】将大语言模型绑定至虚拟人格：实现人类行为模拟

稀疏自编码器综述：解释大语言模型的内部机制

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Call for Nominations: 2022 Multimedia Prize Paper Award

Call for Nominations: 2022 Multimedia Prize Paper Award

CCF多媒体专委会

0+阅读 · 2022年2月12日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

相关论文

Evaluation of creating scoring opportunities for teammates in soccer via trajectory prediction

Arxiv

0+阅读 · 2022年7月27日

Motion Planning in Dynamic Environments Using Context-Aware Human Trajectory Prediction

Arxiv

1+阅读 · 2022年7月26日

On the Interaction between Test-Suite Reduction and Regression-Test Selection Strategies

Arxiv

0+阅读 · 2022年7月26日

Statistical and Computational Trade-offs in Variational Inference: A Case Study in Inferential Model Selection

Statistical and Computational Trade-offs in Variational Inference: A Case Study in Inferential Model Selection

Arxiv

0+阅读 · 2022年7月22日

Optimal Model Averaging of Support Vector Machines in Diverging Model Spaces

Arxiv

0+阅读 · 2022年7月22日

相关基金

PP2Cδ调控的线粒体ROS通路在肺损伤和炎症中的作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Aubry-Mather理论在弱光滑平面微分系统中的应用

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

YAP2在神经祖细胞增殖维持和分化中的作用及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员