The automatic semantic segmentation of the huge amount of acquired remote sensing data has become an important task in the last decade. Images and Point Clouds (PCs) are fundamental data representations, particularly in urban mapping applications. Textured 3D meshes integrate both data representations geometrically by wiring the PC and texturing the surface elements with available imagery. We present a mesh-centered holistic geometry-driven methodology that explicitly integrates entities of imagery, PC and mesh. Due to its integrative character, we choose the mesh as the core representation that also helps to solve the visibility problem for points in imagery. Utilizing the proposed multi-modal fusion as the backbone and considering the established entity relationships, we enable the sharing of information across the modalities imagery, PC and mesh in a two-fold manner: (i) feature transfer and (ii) label transfer. By these means, we achieve to enrich feature vectors to multi-modal feature vectors for each representation. Concurrently, we achieve to label all representations consistently while reducing the manual label effort to a single representation. Consequently, we facilitate to train machine learning algorithms and to semantically segment any of these data representations - both in a multi-modal and single-modal sense. The paper presents the association mechanism and the subsequent information transfer, which we believe are cornerstones for multi-modal scene analysis. Furthermore, we discuss the preconditions and limitations of the presented approach in detail. We demonstrate the effectiveness of our methodology on the ISPRS 3D semantic labeling contest (Vaihingen 3D) and a proprietary data set (Hessigheim 3D).
翻译:图像和点云(PC)是基本的数据表达形式,特别是在城市绘图应用中。 Textured 3D meshes 将数据表达方式以几何方式结合,方法是将PC对齐,并将表层元素与现有图像相纹。我们展示了一种以网状为中心的整体几何驱动方法,明确将图像、PC和网目等实体纳入其中。由于其综合特性,我们选择了网路作为核心代表形式,这也有助于解决图像点的可见度问题。利用拟议的多模式融合作为主干线,并考虑到已经建立的实体关系。我们以两重的方式将数据表达方式、PC和网目中的数据表达方式结合起来:(一) 特征传输和(二) 标签传输。通过这些手段,我们实现了将特性矢量与每个显示的多模式特性矢量相融合。同时,我们实现了将所有表示方式标记一致,同时将手动标签的努力降低到单一代表形式。因此,我们为D 培训机机算的节能算法和结构分析, 将数据转换成一个系统模型和结构分析。