We study a practical yet hasn't been explored problem: how a drone can perceive in an environment from viewpoints of different flight heights. Unlike autonomous driving where the perception is always conducted from a ground viewpoint, a flying drone may flexibly change its flight height due to specific tasks, requiring capability for viewpoint invariant perception. To reduce the effort of annotation of flight data, we consider a ground-to-aerial knowledge distillation method while using only labeled data of ground viewpoint and unlabeled data of flying viewpoints. To this end, we propose a progressive semi-supervised learning framework which has four core components: a dense viewpoint sampling strategy that splits the range of vertical flight height into a set of small pieces with evenly-distributed intervals, and at each height we sample data from that viewpoint; the nearest neighbor pseudo-labeling that infers labels of the nearest neighbor viewpoint with a model learned on the preceding viewpoint; MixView that generates augmented images among different viewpoints to alleviate viewpoint difference; and a progressive distillation strategy to gradually learn until reaching the maximum flying height. We collect a synthesized dataset and a real-world dataset, and we perform extensive experiments to show that our method yields promising results for different flight heights.
翻译:我们研究了一个实际的,但还没有被探索过的问题:无人机如何从不同飞行高度的角度来看待一个环境中的不同飞行高度。与自发驾驶不同,在自发驾驶中,感知总是从地面角度进行,飞行无人机可能灵活地改变飞行高度,因为具体任务要求有辨别观点的能力。为了减少飞行数据注解的努力,我们考虑一种地对空知识蒸馏方法,同时只使用贴标签的地面视角数据和飞行视角的未贴标签数据。为此,我们提议了一个渐进的半监督学习框架,它有四个核心组成部分:密集观点取样战略,将垂直飞行高度的范围分成一组小块,间隔均衡,每次高度我们从这个角度抽样数据;近邻的假标签,将附近邻居观点的标签与从前一个角度学到的模型相近;MixViver,在不同观点中生成增强的图像,以缓解观点的差异;以及渐进的蒸馏战略,以逐步学习,直至达到最高飞行高度。我们收集了综合的数据集和真实的飞行结果,以显示我们有希望的飞行结果。