VIBUS: 数据效率高的 3D 采样场,与VIewpoint Bolttleneck 和不确定特征建模 (VIBUS: Data-efficient 3D Scene Parsing with VIewpoint Bottleneck and Uncertainty-Spectrum Modeling)

Recently, 3D scenes parsing with deep learning approaches has been a heating topic. However, current methods with fully-supervised models require manually annotated point-wise supervision which is extremely user-unfriendly and time-consuming to obtain. As such, training 3D scene parsing models with sparse supervision is an intriguing alternative. We term this task as data-efficient 3D scene parsing and propose an effective two-stage framework named VIBUS to resolve it by exploiting the enormous unlabeled points. In the first stage, we perform self-supervised representation learning on unlabeled points with the proposed Viewpoint Bottleneck loss function. The loss function is derived from an information bottleneck objective imposed on scenes under different viewpoints, making the process of representation learning free of degradation and sampling. In the second stage, pseudo labels are harvested from the sparse labels based on uncertainty-spectrum modeling. By combining data-driven uncertainty measures and 3D mesh spectrum measures (derived from normal directions and geodesic distances), a robust local affinity metric is obtained. Finite gamma/beta mixture models are used to decompose category-wise distributions of these measures, leading to automatic selection of thresholds. We evaluate VIBUS on the public benchmark ScanNet and achieve state-of-the-art results on both validation set and online test server. Ablation studies show that both Viewpoint Bottleneck and uncertainty-spectrum modeling bring significant improvements. Codes and models are publicly available at https://github.com/AIR-DISCOVER/VIBUS.

翻译：最近,3D场景与深层学习方法的剖析一直是供暖话题。然而,目前采用完全监督模型的方法需要人工加注的点对点监督,而这种监督非常不方便用户,而且耗费时间才能获得。因此,培训3D场对点分析模型而少监督是一个令人感兴趣的替代办法。我们将此任务称为数据效率3D场对点的剖析,并提议一个名为VIBUS的两阶段有效框架,以利用巨大的未标点加以解决。在第一阶段,我们用拟议的Viewpoint Bottleneck损失功能在未标点上进行自我监督的代表学习。损失功能源自于在不同角度对场景强加的信息瓶颈目标,使代表过程无需退化和取样。在第二阶段,假标签是从基于不确定性光谱模型的稀疏标签中提取的。通过将数据驱动的不确定性模型计量和3Dmelgel(来自正常方向和地理偏差)的频谱计量,在拟议的Vettlenational-commilational Referational Reforational Real-deal developational developmental developmental ress) Abal-Bral press 和Wegradustreval pressmental missueal develildal degradustrismlational messal messal res-bal messal messal messal deal ress press pressal develessal ress ress ress ress 和Webismlationalizlationalmessal deplational ress ressalbress ress lactions ress ress ress ress ressal ress ressal ressal ressal ress ress ress labal labalbalbalbalbalbalbalbal ressal ress res res ress labal labal ress ress ress labal ress ress labal resemememememememememememememememems ress

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

59+阅读 · 2020年1月25日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日