直观-1-线性:一个椭圆-线性相片照相机粒子估计问题的拟订、分析和解决办法 (Perspective-1-Ellipsoid: Formulation, Analysis and Solutions of the Camera Pose Estimation Problem from One Ellipse-Ellipsoid Correspondence)

In computer vision, camera pose estimation from correspondences between 3D geometric entities and their projections into the image has been a widely investigated problem. Although most state-of-the-art methods exploit low-level primitives such as points or lines, the emergence of very effective CNN-based object detectors in the recent years has paved the way to the use of higher-level features carrying semantically meaningful information. Pioneering works in that direction have shown that modelling 3D objects by ellipsoids and 2D detections by ellipses offers a convenient manner to link 2D and 3D data. However, the mathematical formalism most often used in the related litterature does not enable to easily distinguish ellipsoids and ellipses from other quadrics and conics, leading to a loss of specificity potentially detrimental in some developments. Moreover, the linearization process of the projection equation creates an over-representation of the camera parameters, also possibly causing an efficiency loss. In this paper, we therefore introduce an ellipsoid-specific theoretical framework and demonstrate its beneficial properties in the context of pose estimation. More precisely, we first show that the proposed formalism enables to reduce the pose estimation problem to a position or orientation-only estimation problem in which the remaining unknowns can be derived in closed-form. Then, we demonstrate that it can be further reduced to a 1 Degree-of-Freedom (1DoF) problem and provide the analytical derivations of the pose as a function of that unique scalar unknown. We illustrate our theoretical considerations by visual examples and include a discussion on the practical aspects. Finally, we release this paper along with the corresponding source code in order to contribute towards more efficient resolutions of ellipsoid-related pose estimation problems.

翻译：在计算机视野中,摄像头代表了3D几何实体之间的通信,对图像的预测是一个广泛调查的问题。尽管大多数最先进的数学形式学方法都利用了点或线等低层次原始元素,但近年来出现了非常有效的CNN天体探测器,从而导致使用含有具有语义意义信息的更高层次的特征。在这方面的显微镜工程表明,用椭圆形和椭圆形的2D探测来模拟3D天体,为连接2D和3D数据提供了一种独特的方法。然而,在相关直观学中最经常使用的数学形式学方法无法轻易区分粒子或直径等等等低层次原始体,但近年来出现了非常有效的CNN天线性天体物体探测器,导致某些事态发展中可能有害的特性丧失。此外,预测方程式的线性化过程造成了摄像参数的过度代表,也可能造成效率损失。因此,我们引入了一种独特的理论框架,并展示了它对于2D数据的有益特性。更精确地在相关的直观上,我们首先通过直观的直观讨论来显示,我们所处的直径直观的正向的正向的正向方向的函数显示,从而可以显示我们所处的直径向的直径向的直径向的直径向的直方推推的推的推,从而显示了我们所推的直推的推的推的推的推的推论,从而推论的推论,从而推论的推论,从而的推论可以使最终定的推论的推的推的推的推,从而推,从而推论的推论的推论可以减少了我们所推论的推论,从而推论的推论的推的推的推论,从而推论的推论的推论的推论,从而使得了我们所推的推的推的推的推的推论,从而推的推论,从而推的推的推的推的推论可以使使得的推论,从而推论可以使得的推论,从而的推论,从而推论可以使了我们所推论会使得的推论可以造成的推论,从而的推的推论,从而推论可以使了我们所推论的推论,从而推论的推论,从而推论的推论的推的推论的推论