Unmanned aerial vehicles assist in maritime search and rescue missions by flying over large search areas to autonomously search for objects or people. Reliably detecting objects of interest requires fast models to employ on embedded hardware. Moreover, with increasing distance to the ground station only part of the video data can be transmitted. In this work, we consider the problem of finding meaningful region of interest proposals in a video stream on an embedded GPU. Current object or anomaly detectors are not suitable due to their slow speed, especially on limited hardware and for large image resolutions. Lastly, objects of interest, such as pieces of wreckage, are often not known a priori. Therefore, we propose an end-to-end future frame prediction model running in real-time on embedded GPUs to generate region proposals. We analyze its performance on large-scale maritime data sets and demonstrate its benefits over traditional and modern methods.
翻译:无人驾驶航空飞行器通过飞越大搜索区自主搜索物体或人员,协助海上搜索和救援任务。可靠的探测对象要求对嵌入硬件使用快速模型。此外,随着与地面站的距离越来越远,只能传送部分视频数据。在这项工作中,我们考虑在嵌入的GPU的视频流中找到有意义的感兴趣区域建议的问题。当前天体或异常探测器由于速度缓慢,特别是硬件有限和图像分辨率大,不适合进行海上搜索和救援。最后,像残骸碎片这样的对象往往不为人所知。因此,我们提议在嵌入的GPUS上实时运行一个端到端的未来框架预测模型,以产生区域建议。我们分析其在大型海洋数据集上的性能,并展示其对传统和现代方法的好处。