Camera, and associated with its objects within the field of view, localization could benefit many computer vision fields, such as autonomous driving, robot navigation, and augmented reality (AR). In this survey, we first introduce specific application areas and the evaluation metrics for camera localization pose according to different sub-tasks (learning-based 2D-2D task, feature-based 2D-3D task, and 3D-3D task). Then, we review common methods for structure-based camera pose estimation approaches, absolute pose regression and relative pose regression approaches by critically modelling the methods to inspire further improvements in their algorithms such as loss functions, neural network structures. Furthermore, we summarise what are the popular datasets used for camera localization and compare the quantitative and qualitative results of these methods with detailed performance metrics. Finally, we discuss future research possibilities and applications.
翻译:摄影机及其在视野范围内的物体,本地化可使许多计算机视觉领域受益,如自主驱动、机器人导航和扩大现实(AR ) 。 在本次调查中,我们首先引入具体的应用领域和照相机本地化评价指标,根据不同的子任务(基于2D-2D学习的任务、基于2D-3D特性的任务和3D-3D的任务)构成。 然后,我们审查基于结构的照相机的共同方法,提出估计方法、绝对回归和相对回归方法,通过批判性建模方法,激励其算法的进一步改进,如损失功能、神经网络结构等。此外,我们总结了用于相机本地化的流行数据集,并将这些方法的数量和质量结果与详细的性能指标进行比较。最后,我们讨论了未来的研究可能性和应用。