Climate-induced disasters are and will continue to be on the rise, and thus search-and-rescue (SAR) operations, where the task is to localize and assist one or several people who are missing, become increasingly relevant. In many cases the rough location may be known and a UAV can be deployed to explore a given, confined area to precisely localize the missing people. Due to time and battery constraints it is often critical that localization is performed as efficiently as possible. In this work we approach this type of problem by abstracting it as an aerial view goal localization task in a framework that emulates a SAR-like setup without requiring access to actual UAVs. In this framework, an agent operates on top of an aerial image (proxy for a search area) and is tasked with localizing a goal that is described in terms of visual cues. To further mimic the situation on an actual UAV, the agent is not able to observe the search area in its entirety, not even at low resolution, and thus it has to operate solely based on partial glimpses when navigating towards the goal. To tackle this task, we propose AiRLoc, a reinforcement learning (RL)-based model that decouples exploration (searching for distant goals) and exploitation (localizing nearby goals). Extensive evaluations show that AiRLoc outperforms heuristic search methods as well as alternative learnable approaches, and that it generalizes across datasets, e.g. to disaster-hit areas without seeing a single disaster scenario during training. We also conduct a proof-of-concept study which indicates that the learnable methods outperform humans on average. Code and models have been made publicly available at https://github.com/aleksispi/airloc.
翻译:气候引发的灾害正在并且将继续上升,因此搜索和救援(SAR)行动越来越具有相关性。许多情况下,可能知道粗糙的地点,无人驾驶航空器可以被部署来探索某个特定地区,但仅限于准确确定失踪人员的本地化。由于时间和电池的限制,通常至关重要的是,如何尽可能高效地进行本地化。在这项工作中,我们通过将这类问题作为一个空中视图目标定位任务来处理,在类似SAR的设置的框架中,在不需要访问实际的UAV区域的情况下,将它作为航空目标定位任务。在这个框架中,一个代理在空中图像的顶端操作(搜索区域的代理),并可以部署无人驾驶飞行器来探索某个特定地区。由于时间和电池的限制,该代理无法尽可能有效地进行本地化工作,甚至无法在可以低清晰度的情况下,因此,它必须完全以部分直观的方式运行。为了应对这一任务,我们建议AiLLL的深度搜索方法,在进行远程搜索时,我们提议在进行远程搜索时要展示一个目标,以学习一个目标,在远程搜索方法上进行。