Image based localization is a classical computer vision challenge, with several well-known datasets. Generally, datasets consist of a visual 3D database that captures the modeled scenery, as well as query images whose 3D pose is to be discovered. Usually the query images have been acquired with a camera that differs from the imaging hardware used to collect the 3D database; consequently, it is hard to acquire accurate ground truth poses between query images and the 3D database. As the accuracy of visual localization algorithms constantly improves, precise ground truth becomes increasingly important. This paper proposes TBPos, a novel large-scale visual dataset for image based positioning, which provides query images with fully accurate ground truth poses: both the database images and the query images have been derived from the same laser scanner data. In the experimental part of the paper, the proposed dataset is evaluated by means of an image-based localization pipeline.
翻译:以图像为基础的本地化是一个典型的计算机视觉挑战,它有若干众所周知的数据集。 一般来说, 数据集包括一个视觉的 3D 数据库, 收集模型外景, 以及3D 外形的查询图像。 通常, 查询图像是用与用于收集 3D 数据库的成像硬件不同的相机获得的; 因此, 很难获得查询图像和 3D 数据库之间的准确地面真象。 由于视觉本地化算法的准确性不断提高, 精确的地面真象变得日益重要 。 本文提出TBPos, 是一个用于基于图像的定位的新型大型视觉数据集, 提供具有完全准确地面真象的查询图像: 数据库图像和查询图像都是从相同的激光扫描器数据中衍生出来的。 在本文的实验部分, 提议的数据集是通过基于图像的本地化管道来评估的。