Camera localization, i.e., camera pose regression, represents an important task in computer vision since it has many practical applications such as in the context of intelligent vehicles and their localization. Having reliable estimates of the regression uncertainties is also important, as it would allow us to catch dangerous localization failures. In the literature, uncertainty estimation in Deep Neural Networks (DNNs) is often performed through sampling methods, such as Monte Carlo Dropout (MCD) and Deep Ensemble (DE), at the expense of undesirable execution time or an increase in hardware resources. In this work, we considered an uncertainty estimation approach named Deep Evidential Regression (DER) that avoids any sampling technique, providing direct uncertainty estimates. Our goal is to provide a systematic approach to intercept localization failures of camera localization systems based on DNNs architectures, by analyzing the generated uncertainties. We propose to exploit CMRNet, a DNN approach for multi-modal image to LiDAR map registration, by modifying its internal configuration to allow for extensive experimental activity on the KITTI dataset. The experimental section highlights CMRNet's major flaws and proves that our proposal does not compromise the original localization performances but also provides, at the same time, the necessary introspection measures that would allow end-users to act accordingly.
翻译:相机本地化,即相机回归,是计算机视觉中的一项重要任务,因为它有许多实际应用,如智能车辆及其本地化。对回归不确定性的可靠估计也很重要,因为这将使我们能够捕捉危险的本地化失败。在文献中,深神经网络(DNNS)的不确定性估计往往通过取样方法进行,如蒙特卡洛脱落(MCD)和深成形(DE),以牺牲不受欢迎的执行时间或增加硬件资源为代价。在这项工作中,我们考虑了称为深显回归(DER)的不确定性估计方法,以避免任何取样技术,提供直接的不确定性估计。我们的目标是通过分析生成的不确定性,提供系统的方法,以截取基于DNNS结构的相机本地化系统本地化故障。我们提议利用CMRNet(MNet)和深成像(DNN)的多模式图像注册,修改内部配置,以便能够在KITTI数据集上开展广泛的实验活动。实验部分强调CMRNet的主要缺陷,提供直接的不确定性估计。我们的目标是通过分析生成的原始操作,并证明我们的原始操作方法不会损害当地原始操作。</s>