Estimating the pose of an object from a monocular image is an inverse problem fundamental in computer vision. The ill-posed nature of this problem requires incorporating deformation priors to solve it. In practice, many materials do not perceptibly shrink or extend when manipulated, constituting a powerful and well-known prior. Mathematically, this translates to the preservation of the Riemannian metric. Neural networks offer the perfect playground to solve the surface reconstruction problem as they can approximate surfaces with arbitrary precision and allow the computation of differential geometry quantities. This paper presents an approach to inferring continuous deformable surfaces from a sequence of images, which is benchmarked against several techniques and obtains state-of-the-art performance without the need for offline training.
翻译:以单镜图像来估计物体的外形是计算机视觉中一个根本的反面问题。这个问题的错误性质要求纳入变形前科来解决这个问题。实际上,许多材料在被操纵时不会明显缩缩或延伸,这构成了一个强大和众所周知的先行。从数学角度讲,这转化为保存里曼尼度量度。神经网络为解决地表重建问题提供了完美的操场,因为它们可以任意精确地接近表面,并可以计算不同的几何数量。本文提出了一个方法,从一系列图像中推断出连续变形的表面,这些图像以几种技术为基准,在不需要离线培训的情况下获得最先进的性能。