The reconstruction of a scene via a stereo-camera system is a two-steps process, where at first images from different cameras are matched to identify the set of point-to-point correspondences that then will actually be reconstructed in the three dimensional real world. The performance of the system strongly relies of the calibration procedure, which has to be carefully designed to guarantee optimal results. We implemented three different calibration methods and we compared their performance over 19 datasets. We present the experimental evidence that, due to the image noise, a single set of parameters is not sufficient to achieve high accuracy in the identification of the correspondences and in the 3D reconstruction at the same time. We propose to calibrate the system twice to estimate two different sets of parameters: the one obtained by minimizing the reprojection error that will be used when dealing with quantities defined in the 2D space of the cameras, and the one obtained by minimizing the reconstruction error that will be used when dealing with quantities defined in the real 3D world.
翻译:通过立体摄像机系统重建场景是一个两步过程,首先将不同相机的图像与最初的相匹配,以确定在三维现实世界中将实际重建的一组点对点通信。该系统的性能强烈依赖校准程序,而校准程序必须仔细设计,以保证最佳结果。我们采用了三种不同的校准方法,并将其性能比照19个数据集。我们提出了实验证据,证明由于图像噪音,单一的一组参数不足以在同一时间在识别通信和3D重建中达到高度精确度。我们提议对系统进行两次校准,以估计两套不同的参数:一个是尽量减少在处理2D摄像头空间确定的数量时将使用的重新预测错误,另一个是尽量减少在处理实际3D世界确定的数量时将使用的重建错误。