Recognition and segmentation of objects in images enjoy the wealth of large volume of well annotated data. At the other end, when dealing with the reconstruction of geometric structures of objects from images, there is a limited amount of accurate data available for supervised learning. One type of such geometric data with insufficient amount required for deep learning is real world accurate RGB-D images. The lack of accurate RGB-D datasets is one of the obstacles in the evolution of geometric scene reconstructions from images. One solution to creating such a dataset is to capture RGB images while simultaneously using an accurate depth scanning device that assigns a depth value to each pixel. A major challenge in acquiring such ground truth data is the accurate alignment between the RGB images and the measured depth and color profiles. We introduce a differential optimization method that aligns a colored point cloud to a given color image via iterative geometric and color matching. The proposed method enables the construction of RGB-D datasets for specific camera systems. In the suggested framework, the optimization minimizes the difference between the colors of the image pixels and the corresponding colors of the projected points to the camera plane. We assume that the colors produced by the geometric scanner camera and the color camera sensor are different and thus are characterized by different chromatic acquisition properties. We align the different color spaces while compensating for their corresponding color appearance. Under this setup, we find the transformation between the camera image and the point cloud colors by iterating between matching the relative location of the point cloud and matching colors. The successful alignments produced by the proposed method are demonstrated on both synthetic data with quantitative evaluation and real world scenes with qualitative results.
翻译:图像中天体的识别和分解拥有大量高清晰度数据的丰富。 在另一端, 当处理图像对象的几何结构的重建时, 可用于监督学习的准确数据数量有限。 深度学习所需数量不足的某类几何数据是真实的、 准确的 RGB- D 图像。 缺乏准确的 RGB- D 数据集是图像中几何场景重建进化的一个障碍。 创建此数据集的一个解决方案是捕获 RGB- D 图像, 同时使用精确的深度扫描设备给每个像素分配一个深度值。 获取这种地面真实数据的重大挑战是 RGB 图像与测量的深度和颜色剖析所需的准确性数据。 我们引入了一种差异性优化方法, 将彩色点云与给定的彩色图像重建。 在建议的框架中, 优化将图像象素的颜色与所预测值的颜色颜色之间的差别缩小到每个像标定点之间的差别。 我们以不同颜色的色标定的颜色, 以不同的摄像头显示不同的颜色, 我们以不同的颜色来绘制不同的颜色, 以不同的摄像头的颜色来测量。 我们以不同的颜色来, 以不同的颜色来显示不同的摄像师的颜色, 以不同的摄像师的颜色来测量的颜色来测量的颜色来显示。