Climate-induced disasters are and will continue to be on the rise, and thus search-and-rescue (SAR) operations, where the task is to localize and assist one or several people who are missing, become increasingly relevant. In many cases the rough location may be known and a UAV can be deployed to explore a given, confined area to precisely localize the missing people. Due to time and battery constraints it is often critical that localization is performed as efficiently as possible. In this work we approach this type of problem by abstracting it as an aerial view goal localization task in a framework that emulates a SAR-like setup without requiring access to actual UAVs. In this framework, an agent operates on top of an aerial image (proxy for a search area) and is tasked with localizing a goal that is described in terms of visual cues. To further mimic the situation on an actual UAV, the agent is not able to observe the search area in its entirety, not even at low resolution, and thus it has to operate solely based on partial glimpses when navigating towards the goal. To tackle this task, we propose AiRLoc, a reinforcement learning (RL)-based model that decouples exploration (searching for distant goals) and exploitation (localizing nearby goals). Extensive evaluations show that AiRLoc outperforms heuristic search methods as well as alternative learnable approaches, and that it generalizes across datasets, e.g. to disaster-hit areas without seeing a single disaster scenario during training. We also conduct a proof-of-concept study which indicates that the learnable methods outperform humans on average. Code and models have been made publicly available at https://github.com/aleksispi/airloc.
翻译:气候引发的灾害正在并且将继续上升,因此搜索和救援(SAR)行动越来越具有相关性。在许多情况下,可能会知道粗糙的位置,无人驾驶航空器可以被部署来探索某个特定地区,但仅限于准确确定失踪人员的本地化。由于时间和电池的限制,通常至关重要的是尽可能高效地进行本地化工作。在这项工作中,我们通过将这类问题作为空中视图目标定位任务来处理,在类似SAR的设置中,在不需要访问实际的UAV区域的情况下,将之作为航空目标定位任务。在这个框架中,一个代理在空中图像的顶端操作(搜索区域的代理),并可以部署无人驾驶飞行器来探索某个特定地区。由于时间和电池的限制,该代理无法尽可能高效地进行本地化工作,甚至无法在可识别的地面上观察搜索区域,因此,它只能以部分直观的方式在类似SAR的平台上运行,而无需访问实际的UAAV区域。为了完成这一任务,我们建议AiLLA的深度的探索方法, 并显示在远方的搜索方法上学习一个目标。</s>