Our work is motivated by environmental monitoring tasks, where finding the global maxima (i.e., hotspot) of a spatially varying field is crucial. We investigate the problem of identifying the hotspot for fields that can be sensed using an Unmanned Aerial Vehicle (UAV) equipped with a downward-facing camera. The UAV has a limited time budget which it can use for learning the unknown field and identifying the hotspot. Our contribution is to show how this problem can be formulated as a novel multi-fidelity variant of the Gaussian Process (GP) multi-armed bandit problem. The novelty is two-fold: (i) unlike standard multi-armed bandit settings, the rewards of the arms are correlated with each other; and (ii) unlike standard GP regression, the measurements in our problem are images (i.e., vector measurements) whose quality depends on the altitude of the UAV. We present a strategy for finding the sequence of UAV sensing locations and empirically compare it with several baselines. Experimental results using images gathered onboard a UAV are also presented and the scalability of the proposed methodology is assessed in a large-scale simulated environment in Gazebo.
翻译:我们的工作是由环境监测任务推动的,在环境监测任务中,找到一个空间差异的地块的全球最高点(即热点)至关重要。我们调查了如何为使用配备向下向下摄像机的无人驾驶航空飞行器(UAV)能够感知的田地确定热点的问题。无人驾驶航空飞行器有有限的时间预算,可用于学习未知田地和确定热点。我们的贡献是表明如何将这一问题发展成高山进程(GP)多臂土匪问题的新颖的多侧面变体。新颖的有两重:(一) 不同于标准的多臂土匪环境,武器奖赏是相互关联的;(二) 与标准的GP回归不同,我们问题的测量标准是图像(即矢量测量),其质量取决于无人驾驶飞行器的高度。我们提出了一个战略,以寻找无人驾驶飞行器感测地点的序列,并将它与几个基线进行实验性比较。在UAVAVE上收集的图像的实验结果也是在大规模环境中进行的模拟评估。