单一查看的3D重建网络中“重建比重识别”的数据集分布视角 (A Dataset-Dispersion Perspective on Reconstruction Versus Recognition in Single-View 3D Reconstruction Networks)

Neural networks (NN) for single-view 3D reconstruction (SVR) have gained in popularity. Recent work points out that for SVR, most cutting-edge NNs have limited performance on reconstructing unseen objects because they rely primarily on recognition (i.e., classification-based methods) rather than shape reconstruction. To understand this issue in depth, we provide a systematic study on when and why NNs prefer recognition to reconstruction and vice versa. Our finding shows that a leading factor in determining recognition versus reconstruction is how dispersed the training data is. Thus, we introduce the dispersion score, a new data-driven metric, to quantify this leading factor and study its effect on NNs. We hypothesize that NNs are biased toward recognition when training images are more dispersed and training shapes are less dispersed. Our hypothesis is supported and the dispersion score is proved effective through our experiments on synthetic and benchmark datasets. We show that the proposed metric is a principal way to analyze reconstruction quality and provides novel information in addition to the conventional reconstruction score.

翻译：用于单一视角的3D重建(SVR)的神经网络(NN)越来越受欢迎。最近的工作指出,对于SVR来说,大多数尖端的NNP在重建无形物体方面表现有限,因为它们主要依靠承认(即基于分类的方法)而不是形状重建。为了深入了解这一问题,我们提供了系统研究,说明NNP在何时和为什么更倾向于承认重建,反之亦然。我们的调查结果表明,确定承认与重建之间的一个主导因素是培训数据是如何分散的。因此,我们引入了分散评分,即新的数据驱动计量,以量化这一主导因素,并研究其对NNP的影响。我们假设,当培训图像更加分散,培训形状不那么分散时,NNP偏重于承认。我们的假设得到支持,分散评分通过我们关于合成和基准数据集的实验证明有效。我们表明,拟议的指标是分析重建质量的主要方法,除了常规重建评分之外,还提供新的信息。

相关内容

三维重建

关注 1173

在计算机视觉中, 三维重建是指根据单视图或者多视图的图像重建三维信息的过程. 由于单视频的信息不完全,因此三维重建需要利用经验知识. 而多视图的三维重建(类似人的双目定位)相对比较容易, 其方法是先对摄像机进行标定, 即计算出摄像机的图象坐标系与世界坐标系的关系.然后利用多个二维图象中的信息重建出三维信息。物体三维重建是计算机辅助几何设计(CAGD)、计算机图形学(CG)、计算机动画、计算机视觉、医学图像处理、科学计算和虚拟现实、数字媒体创作等领域的共性科学问题和核心技术。在计算机内生成物体三维表示主要有两类方法。一类是使用几何建模软件通过人机交互生成人为控制下的物体三维几何模型,另一类是通过一定的手段获取真实物体的几何形状。前者实现技术已经十分成熟,现有若干软件支持,比如:3DMAX、Maya、AutoCAD、UG等等,它们一般使用具有数学表达式的曲线曲面表示几何形状。后者一般称为三维重建过程,三维重建是指利用二维投影恢复物体三维信息(形状等)的数学过程和计算机技术,包括数据获取、预处理、点云拼接和特征分析等步骤。