Estimating the pose of an object from a monocular image is an inverse problem fundamental in computer vision. The ill-posed nature of this problem requires incorporating deformation priors to solve it. In practice, many materials do not perceptibly shrink or extend when manipulated, constituting a powerful and well-known prior. Mathematically, this translates to the preservation of the Riemannian metric. Neural networks offer the perfect playground to solve the surface reconstruction problem as they can approximate surfaces with arbitrary precision and allow the computation of differential geometry quantities. This paper presents an approach to inferring continuous deformable surfaces from a sequence of images, which is benchmarked against several techniques and obtains state-of-the-art performance without the need for offline training.
翻译:从单眼图像中估计物体的姿态是计算机视觉中的基本反问题。这个问题的不适定性要求我们在解决它时加入形变先验。在实践中,许多材料在操纵时并不会明显收缩或扩展,这构成了一个强大且广为人知的先验。数学上,这意味着保持黎曼度量。神经网络提供了解决表面重建问题的完美场所,因为它们可以用任意精度近似表面,并允许计算微分几何量。本文提出了一种从图像序列推断连续可变形表面的方法,对该方法进行了多种技术的基准测试,取得了最先进的性能,而无需离线训练。