Aerial pixel-wise scene perception of the surrounding environment is an important task for UAVs (Unmanned Aerial Vehicles). Previous research works mainly adopt conventional pinhole cameras or fisheye cameras as the imaging device. However, these imaging systems cannot achieve large Field of View (FoV), small size, and lightweight at the same time. To this end, we design a UAV system with a Panoramic Annular Lens (PAL), which has the characteristics of small size, low weight, and a 360-degree annular FoV. A lightweight panoramic annular semantic segmentation neural network model is designed to achieve high-accuracy and real-time scene parsing. In addition, we present the first drone-perspective panoramic scene segmentation dataset Aerial-PASS, with annotated labels of track, field, and others. A comprehensive variety of experiments shows that the designed system performs satisfactorily in aerial panoramic scene parsing. In particular, our proposed model strikes an excellent trade-off between segmentation performance and inference speed suitable, validated on both public street-scene and our established aerial-scene datasets.
翻译:对周围环境的空中象素感知是无人驾驶飞行器(无人驾驶飞行器)的一项重要任务。以往的研究工作主要采用传统针孔照相机或鱼眼照相机作为成像装置。然而,这些成像系统不能同时实现大视野(FoV)、小尺寸和轻量量值。为此,我们设计了一个具有小尺寸、低重量和360度废气FOV等特征的无人驾驶飞行器系统。一个轻量的全方位废气断裂式神经网络模型的设计是为了实现高精确度和实时场景分解。此外,我们还展示了第一个无人驾驶-透视全景场分解数据集Air-PASS,并配有轨道、场等附加注释的标签。各种全面实验显示,设计系统在空中全景场的分解中表现令人满意。特别是,我们提议的模型在分解性性能和推导速度适当的情况下,在公共街道和空中固定的数据上都进行了验证。