Robotic exploration of unknown environments is fundamentally a problem of decision making under uncertainty where the robot must account for uncertainty in sensor measurements, localization, action execution, as well as many other factors. For large-scale exploration applications, autonomous systems must overcome the challenges of sequentially deciding which areas of the environment are valuable to explore while safely evaluating the risks associated with obstacles and hazardous terrain. In this work, we propose a risk-aware meta-level decision making framework to balance the tradeoffs associated with local and global exploration. Meta-level decision making builds upon classical hierarchical coverage planners by switching between local and global policies with the overall objective of selecting the policy that is most likely to maximize reward in a stochastic environment. We use information about the environment history, traversability risk, and kinodynamic constraints to reason about the probability of successful policy execution to switch between local and global policies. We have validated our solution in both simulation and on a variety of large-scale real world hardware tests. Our results show that by balancing local and global exploration we are able to significantly explore large-scale environments more efficiently.
翻译:在不确定的情况下,机器人必须对传感器测量、地方化、行动执行以及许多其他因素的不确定性进行解释。对于大规模勘探应用,自主系统必须克服以下挑战:在安全评估与障碍和危险地形相关的风险的同时,按顺序决定哪些环境领域是有价值的,以进行探险;在这项工作中,我们提议了一个具有风险意识的元级决策框架,以平衡与地方和全球勘探有关的权衡。元级决策建立在传统的等级覆盖规划者的基础上,在本地和全球政策之间进行转换,总体目标是选择最有可能在随机环境中获得最大收益的政策。我们利用有关环境历史、可移植风险和动力学限制的信息,以说明成功执行政策的可能性,从而改变地方和全球政策。我们已在模拟和各种大规模实际世界硬件测试中确认了我们的解决方案。我们的结果表明,通过平衡地方和全球的探索,我们能够更高效地大规模地探索环境。