Recovery of an underlying scene geometry from multiview images stands as a long-time challenge in computer vision research. The recent promise leverages neural implicit surface learning and differentiable volume rendering, and achieves both the recovery of scene geometry and synthesis of novel views, where deep priors of neural models are used as an inductive smoothness bias. While promising for object-level surfaces, these methods suffer when coping with complex scene surfaces. In the meanwhile, traditional multi-view stereo can recover the geometry of scenes with rich textures, by globally optimizing the local, pixel-wise correspondences across multiple views. We are thus motivated to make use of the complementary benefits from the two strategies, and propose a method termed Helix-shaped neural implicit Surface learning or HelixSurf; HelixSurf uses the intermediate prediction from one strategy as the guidance to regularize the learning of the other one, and conducts such intertwined regularization iteratively during the learning process. We also propose an efficient scheme for differentiable volume rendering in HelixSurf. Experiments on surface reconstruction of indoor scenes show that our method compares favorably with existing methods and is orders of magnitude faster, even when some of existing methods are assisted with auxiliary training data. The source code is available at https://github.com/Gorilla-Lab-SCUT/HelixSurf.
翻译:从多视图图像中恢复基本场景几何是一个长期的计算机视觉研究挑战。 最近的预示利用神经隐含表面学习和不同体积的生成,实现了现场几何学和新观点合成的恢复,将神经模型的深层前科用作感应性平滑偏差。 这些方法虽然对目标层表面有希望,但在应对复杂场景表面时会受到影响。 与此同时,传统的多视图立体可以通过在全球范围内优化多种观点之间的本地像素-智慧通信,恢复具有丰富质素的场景几何学。 因此,我们积极利用两种战略的互补效益,并提出一种称为Helix-形神经隐含表面学习或HelixSurf的新观点; HeliixSurf 使用一种战略的中间预测作为指导,规范另一个表面表面表面表面表面表面表面表面的学习,并在学习过程中反复进行这种交错的规范化。 我们还提议了一个高效的系统图集,在HelixSurf中将不同的卷进行优化。 甚至对室内场景的表面重建实验显示,我们的方法与现有的辅助数据级系统相比,在现有的系统/辅助系统上具有更快的顺序。</s>