动态方案规划,信息不完整,以克服航海环境中航行不确定性 (Dynamic programming with incomplete information to overcome navigational uncertainty in a nautical environment) - 专知论文

会员服务 ·

0

dynamic programming · Markov · INFORMS · 回合 · 部分可观测马尔可夫决策过程 ·

2022 年 7 月 19 日

Dynamic programming with incomplete information to overcome navigational uncertainty in a nautical environment

翻译：动态方案规划,信息不完整,以克服航海环境中航行不确定性

Chris Beeler,Xinkai Li,Colin Bellinger,Mark Crowley,Maia Fraser,Isaac Tamblyn

from arxiv, 11 pages, 5 figures

Using a novel toy nautical navigation environment, we show that dynamic programming can be used when only incomplete information about a partially observed Markov decision process (POMDP) is known. By incorporating uncertainty into our model, we show that navigation policies can be constructed that maintain safety, outperforming the baseline performance of traditional dynamic programming for Markov decision processes (MDPs). Adding in controlled sensing methods, we show that these policies can also lower measurement costs at the same time.

翻译：使用新型的玩具航行环境,我们证明,当只知道关于部分观察到的Markov决策过程(POMDP)的不完整信息时,就可以使用动态编程。通过将不确定性纳入我们的模型,我们证明可以制定维护安全的导航政策,超过Markov决策过程(MDPs)传统动态编程(MDPs)的基线性能。在控制遥感方法中,我们显示这些政策也可以同时降低测量成本。

0

相关内容

dynamic programming

dynamic programming

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

专知会员服务

40+阅读 · 2020年9月21日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

Cidea和Fsp27蛋白调控机体脂代谢的功能研究

国家自然科学基金

0+阅读 · 2017年12月31日

细菌角蛋白酶KerF降解角蛋白过程与分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

Progerin/PrelaminA诱发早老症的蛋白质组学研究

国家自然科学基金

1+阅读 · 2015年12月31日

罗巴代数的表示和罗巴代数在operad中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

四种肉蛋白摄入对大鼠肠道微生态和肝脏脂质代谢的影响

国家自然科学基金

0+阅读 · 2014年12月31日

重金属离子胁迫下花斑裸鲤钙调蛋白磷酸酶(Calcineurin)的应答及其分子调节机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

低维量子系统中两体与多体量子关联指示相变的研究

国家自然科学基金

0+阅读 · 2012年12月31日

uPAR在慢性阻塞性肺疾病小气道上皮细胞EMT中作用的研究

国家自然科学基金

0+阅读 · 2012年12月31日

Cocycle动力学和拟周期薛定谔算子的谱

国家自然科学基金

0+阅读 · 2012年12月31日

Q型烟粉虱RNAi过程中dsRNA吸收途径研究

国家自然科学基金

0+阅读 · 2012年12月31日

VINet: Visual and Inertial-based Terrain Classification and Adaptive Navigation over Unknown Terrain

Arxiv

0+阅读 · 2022年9月16日

Overcoming Exploration: Deep Reinforcement Learning in Complex Environments from Temporal Logic Specifications

Arxiv

0+阅读 · 2022年9月16日

Sequential Bayesian Optimization for Adaptive Informative Path Planning with Multimodal Sensing

Arxiv

0+阅读 · 2022年9月16日

Efficient learning of nonlinear prediction models with time-series privileged information

Arxiv

0+阅读 · 2022年9月15日

Applying Back Propagation Algorithm and Analytic Hierarchy Process to Environment Assessment

Arxiv

0+阅读 · 2022年9月15日

Robust Safe Control for Uncertain Dynamic Models

Arxiv

0+阅读 · 2022年9月14日

RASR: Risk-Averse Soft-Robust MDPs with EVaR and Entropic Risk

Arxiv

0+阅读 · 2022年9月14日

Multi-robot Mission Planning in Dynamic Semantic Environments

Arxiv

1+阅读 · 2022年9月13日

Multi-Agent Simulation for AI Behaviour Discovery in Operations Research

Arxiv

39+阅读 · 2021年8月30日

A Survey of Uncertainty in Deep Neural Networks

Arxiv

30+阅读 · 2021年7月7日

VIP会员

文章信息

相关主题

dynamic programming

部分可观测马尔可夫决策过程

相关VIP内容

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

专知会员服务

40+阅读 · 2020年9月21日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【伯克利博士论文】通过真实世界实践赋能机器人自主性

军用无人机集群技术尚未成熟——但潜力可期

人工智能安全治理白皮书（2025）

AgentOps综述：分类、挑战与未来方向

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

VINet: Visual and Inertial-based Terrain Classification and Adaptive Navigation over Unknown Terrain

Arxiv

0+阅读 · 2022年9月16日

Overcoming Exploration: Deep Reinforcement Learning in Complex Environments from Temporal Logic Specifications

Arxiv

0+阅读 · 2022年9月16日

Sequential Bayesian Optimization for Adaptive Informative Path Planning with Multimodal Sensing

Arxiv

0+阅读 · 2022年9月16日

Efficient learning of nonlinear prediction models with time-series privileged information

Arxiv

0+阅读 · 2022年9月15日

Applying Back Propagation Algorithm and Analytic Hierarchy Process to Environment Assessment

Arxiv

0+阅读 · 2022年9月15日

Robust Safe Control for Uncertain Dynamic Models

Arxiv

0+阅读 · 2022年9月14日

RASR: Risk-Averse Soft-Robust MDPs with EVaR and Entropic Risk

Arxiv

0+阅读 · 2022年9月14日

Multi-robot Mission Planning in Dynamic Semantic Environments

Arxiv

1+阅读 · 2022年9月13日

Multi-Agent Simulation for AI Behaviour Discovery in Operations Research

Arxiv

39+阅读 · 2021年8月30日

A Survey of Uncertainty in Deep Neural Networks

Arxiv

30+阅读 · 2021年7月7日

相关基金

Cidea和Fsp27蛋白调控机体脂代谢的功能研究

国家自然科学基金

0+阅读 · 2017年12月31日

细菌角蛋白酶KerF降解角蛋白过程与分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

Progerin/PrelaminA诱发早老症的蛋白质组学研究

国家自然科学基金

1+阅读 · 2015年12月31日

罗巴代数的表示和罗巴代数在operad中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

四种肉蛋白摄入对大鼠肠道微生态和肝脏脂质代谢的影响

国家自然科学基金

0+阅读 · 2014年12月31日

重金属离子胁迫下花斑裸鲤钙调蛋白磷酸酶(Calcineurin)的应答及其分子调节机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

低维量子系统中两体与多体量子关联指示相变的研究

国家自然科学基金

0+阅读 · 2012年12月31日

uPAR在慢性阻塞性肺疾病小气道上皮细胞EMT中作用的研究

国家自然科学基金

0+阅读 · 2012年12月31日

Cocycle动力学和拟周期薛定谔算子的谱

国家自然科学基金

0+阅读 · 2012年12月31日

Q型烟粉虱RNAi过程中dsRNA吸收途径研究

国家自然科学基金

0+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员