Image datasets have been steadily growing in size, harming the feasibility and efficiency of large-scale 3D reconstruction methods. In this paper, a novel approach for scaling Multi-View Stereo (MVS) algorithms up to arbitrarily large collections of images is proposed. Specifically, the problem of reconstructing the 3D model of an entire city is targeted, starting from a set of videos acquired by a moving vehicle equipped with several high-resolution cameras. Initially, the presented method exploits an approximately uniform distribution of poses and geometry and builds a set of overlapping clusters. Then, an Integer Linear Programming (ILP) problem is formulated for each cluster to select an optimal subset of views that guarantees both visibility and matchability. Finally, local point clouds for each cluster are separately computed and merged. Since clustering is independent from pairwise visibility information, the proposed algorithm runs faster than existing literature and allows for a massive parallelization. Extensive testing on urban data are discussed to show the effectiveness and the scalability of this approach.
翻译:图像数据集在规模上稳步增长,损害了大规模三维重建方法的可行性和效率。本文提出了将多视角立体算法提升到任意大量收集图像的新办法。具体地说,重建整个城市的三维模型的问题,从装有几部高分辨率相机的移动车辆获得的一组视频开始,从一组视频开始。最初,所介绍的方法利用了大致统一的配置和几何分布,并构建了一组重叠的集群。随后,为每个组制定了一个Intger线性编程问题,以选择一组最佳的观点,既保证可见性,又保证匹配性。最后,每个组的局部点云是单独计算和合并的。由于组合独立于双向可见信息,拟议的算法比现有文献运行得更快,并允许大规模平行化。对城市数据进行了广泛的测试,以显示这种方法的有效性和可扩展性。