为在不确定性下勘探而作出具有风险意识的元级决定 (Risk-aware Meta-level Decision Making for Exploration Under Uncertainty) - 专知论文

会员服务 ·

0

回合 · INFORMS · 机器人 · 约束 · 极大 ·

2022 年 9 月 12 日

Risk-aware Meta-level Decision Making for Exploration Under Uncertainty

翻译：为在不确定性下勘探而作出具有风险意识的元级决定

Joshua Ott,Sung-Kyun Kim,Amanda Bouman,Oriana Peltzer,Mamoru Sobue,Harrison Delecki,Mykel J. Kochenderfer,Joel Burdick,Ali-akbar Agha-mohammadi

Robotic exploration of unknown environments is fundamentally a problem of decision making under uncertainty where the robot must account for uncertainty in sensor measurements, localization, action execution, as well as many other factors. For large-scale exploration applications, autonomous systems must overcome the challenges of sequentially deciding which areas of the environment are valuable to explore while safely evaluating the risks associated with obstacles and hazardous terrain. In this work, we propose a risk-aware meta-level decision making framework to balance the tradeoffs associated with local and global exploration. Meta-level decision making builds upon classical hierarchical coverage planners by switching between local and global policies with the overall objective of selecting the policy that is most likely to maximize reward in a stochastic environment. We use information about the environment history, traversability risk, and kinodynamic constraints to reason about the probability of successful policy execution to switch between local and global policies. We have validated our solution in both simulation and on a variety of large-scale real world hardware tests. Our results show that by balancing local and global exploration we are able to significantly explore large-scale environments more efficiently.

翻译：在不确定的情况下,机器人必须对传感器测量、地方化、行动执行以及许多其他因素的不确定性进行解释。对于大规模勘探应用,自主系统必须克服以下挑战:在安全评估与障碍和危险地形相关的风险的同时,按顺序决定哪些环境领域是有价值的,以进行探险;在这项工作中,我们提议了一个具有风险意识的元级决策框架,以平衡与地方和全球勘探有关的权衡。元级决策建立在传统的等级覆盖规划者的基础上,在本地和全球政策之间进行转换,总体目标是选择最有可能在随机环境中获得最大收益的政策。我们利用有关环境历史、可移植风险和动力学限制的信息,以说明成功执行政策的可能性,从而改变地方和全球政策。我们已在模拟和各种大规模实际世界硬件测试中确认了我们的解决方案。我们的结果表明,通过平衡地方和全球的探索,我们能够更高效地大规模地探索环境。

0

相关内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

凋亡诱导因子AIF调控Wnt信号通路的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

碳交易、互惠偏好与供应链减排博弈研究

国家自然科学基金

1+阅读 · 2015年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

基于NF-κB信号通路研究vaspin与leptin在骨性关节炎中的拮抗作用及分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

基于DMOC的复杂环境飞行器优化轨迹生成实时性能问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

Reality-based Interaction用户界面模型和评估方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

细胞衰老和SENEX基因对老年外周CD4+CD25+ Treg增强的影响

国家自然科学基金

0+阅读 · 2011年12月31日

DNA损伤诱导的p53非依赖性细胞凋亡途径- - -Bim途径

国家自然科学基金

0+阅读 · 2009年12月31日

工程项目可持续建设的广义失效机理分析与控制研究

国家自然科学基金

0+阅读 · 2009年12月31日

Plug-In混合动力汽车能量管理及动力系统优化问题研究

国家自然科学基金

1+阅读 · 2008年12月31日

Reliability-Aware Prediction via Uncertainty Learning for Person Image Retrieval

Arxiv

0+阅读 · 2022年10月24日

Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions

Arxiv

0+阅读 · 2022年10月24日

Robust Anytime Learning of Markov Decision Processes

Arxiv

0+阅读 · 2022年10月24日

Policy Optimization with Advantage Regularization for Long-Term Fairness in Decision Systems

Arxiv

0+阅读 · 2022年10月22日

Efficient Submodular Optimization under Noise: Local Search is Robust

Arxiv

0+阅读 · 2022年10月21日

Safe Policy Improvement in Constrained Markov Decision Processes

Arxiv

0+阅读 · 2022年10月20日

Data-Driven Distributionally Robust Electric Vehicle Balancing for Mobility-on-Demand Systems under Demand and Supply Uncertainties

Arxiv

0+阅读 · 2022年10月19日

A Survey of Decision Making in Adversarial Games

Arxiv

84+阅读 · 2022年7月16日

A Survey on Uncertainty Reasoning and Quantification for Decision Making: Belief Theory Meets Deep Learning

Arxiv

30+阅读 · 2022年6月12日

A Survey of Uncertainty in Deep Neural Networks

Arxiv

30+阅读 · 2021年7月7日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《基于大型语言模型的软件工程自动化研究》最新264页

《基于大型语言模型的信号处理管线研究：推进军事电子情报工作流程》最新76页

中文版 | 战争算法：生成式人工智能在战场的崛起

中文版《美国陆军：战术行为性远程医疗实施观察与建议》

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Reliability-Aware Prediction via Uncertainty Learning for Person Image Retrieval

Arxiv

0+阅读 · 2022年10月24日

Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions

Arxiv

0+阅读 · 2022年10月24日

Robust Anytime Learning of Markov Decision Processes

Arxiv

0+阅读 · 2022年10月24日

Policy Optimization with Advantage Regularization for Long-Term Fairness in Decision Systems

Arxiv

0+阅读 · 2022年10月22日

Efficient Submodular Optimization under Noise: Local Search is Robust

Arxiv

0+阅读 · 2022年10月21日

Safe Policy Improvement in Constrained Markov Decision Processes

Arxiv

0+阅读 · 2022年10月20日

Data-Driven Distributionally Robust Electric Vehicle Balancing for Mobility-on-Demand Systems under Demand and Supply Uncertainties

Arxiv

0+阅读 · 2022年10月19日

A Survey of Decision Making in Adversarial Games

Arxiv

84+阅读 · 2022年7月16日

A Survey on Uncertainty Reasoning and Quantification for Decision Making: Belief Theory Meets Deep Learning

Arxiv

30+阅读 · 2022年6月12日

A Survey of Uncertainty in Deep Neural Networks

Arxiv

30+阅读 · 2021年7月7日

相关基金

凋亡诱导因子AIF调控Wnt信号通路的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

碳交易、互惠偏好与供应链减排博弈研究

国家自然科学基金

1+阅读 · 2015年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

基于NF-κB信号通路研究vaspin与leptin在骨性关节炎中的拮抗作用及分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

基于DMOC的复杂环境飞行器优化轨迹生成实时性能问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

Reality-based Interaction用户界面模型和评估方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

细胞衰老和SENEX基因对老年外周CD4+CD25+ Treg增强的影响

国家自然科学基金

0+阅读 · 2011年12月31日

DNA损伤诱导的p53非依赖性细胞凋亡途径- - -Bim途径

国家自然科学基金

0+阅读 · 2009年12月31日

工程项目可持续建设的广义失效机理分析与控制研究

国家自然科学基金

0+阅读 · 2009年12月31日

Plug-In混合动力汽车能量管理及动力系统优化问题研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员