Stereo matching is a fundamental task for 3D scene reconstruction. Recently, deep learning based methods have proven effective on some benchmark datasets, such as KITTI and Scene Flow. UAVs (Unmanned Aerial Vehicles) are commonly utilized for surface observation, and their captured images are frequently used for detailed 3D reconstruction due to high resolution and low-altitude acquisition. At present, the mainstream supervised learning network requires a significant amount of training data with ground-truth labels to learn model parameters. However, due to the scarcity of UAV stereo matching datasets, the learning-based network cannot be applied to UAV images. To facilitate further research, this paper proposes a novel pipeline to generate accurate and dense disparity maps using detailed meshes reconstructed by UAV images and LiDAR point clouds. Through the proposed pipeline, this paper constructs a multi-resolution UAV scenario dataset, called UAVStereo, with over 34k stereo image pairs covering 3 typical scenes. As far as we know, UAVStereo is the first stereo matching dataset of UAV low-altitude scenarios. The dataset includes synthetic and real stereo pairs to enable generalization from the synthetic domain to the real domain. Furthermore, our UAVStereo dataset provides multi-resolution and multi-scene images pairs to accommodate a variety of sensors and environments. In this paper, we evaluate traditional and state-of-the-art deep learning methods, highlighting their limitations in addressing challenges in UAV scenarios and offering suggestions for future research. The dataset is available at https://github.com/rebecca0011/UAVStereo.git
翻译: Stero 匹配是3D 场景重建的一项根本任务。 最近, 基于深层次学习的方法已证明在一些基准数据集,如 KITTI 和 Scene Flow 上是有效的。 地面观测通常使用无人驾驶航空飞行器(UAVs),由于高分辨率和低纬度获取,其捕获的图像经常用于详细的3D重建。 目前,主流监督的学习网络需要大量带有地面真相标签的培训数据,以学习模型参数。 然而,由于缺乏UAV的立体比对数据集,基于学习的网络无法应用于UAV图像。 为便利进一步研究,本文提出一个新的管道,利用由UAVAV图像和LDAR点云重建的详细缩略图生成准确而密集的差异图。 通过拟议的管道,本文构建了一个多分辨率的UAVSterio, 包括覆盖3个典型场景的34k以上立体立体立体图像配对。 据我们所知, UAVStero- Stepecial是用于UAV- 低纬度图像图像图像的首次立体匹配匹配匹配比对数据。 和常规空间域的合成SDSLSDSDSDSDSD 提供数据提供数据和真实数据, 和真实数据供应的合成和真实数据。