In the context of self-driving vehicles there is strong competition between approaches based on visual localisation and LiDAR. While LiDAR provides important depth information, it is sparse in resolution and expensive. On the other hand, cameras are low-cost and recent developments in deep learning mean they can provide high localisation performance. However, several fundamental problems remain, particularly in the domain of uncertainty, where learning based approaches can be notoriously over-confident. Markov, or grid-based, localisation was an early solution to the localisation problem but fell out of favour due to its computational complexity. Representing the likelihood field as a grid (or volume) means there is a trade off between accuracy and memory size. Furthermore, it is necessary to perform expensive convolutions across the entire likelihood volume. Despite the benefit of simultaneously maintaining a likelihood for all possible locations, grid based approaches were superseded by more efficient particle filters and Monte Carlo Localisation (MCL). However, MCL introduces its own problems e.g. particle deprivation. Recent advances in deep learning hardware allow large likelihood volumes to be stored directly on the GPU, along with the hardware necessary to efficiently perform GPU-bound 3D convolutions and this obviates many of the disadvantages of grid based methods. In this work, we present a novel CNN-based localisation approach that can leverage modern deep learning hardware. By implementing a grid-based Markov localisation approach directly on the GPU, we create a hybrid CNN that can perform image-based localisation and odometry-based likelihood propagation within a single neural network. The resulting approach is capable of outperforming direct pose regression methods as well as state-of-the-art localisation systems.
翻译:在自驾驶车辆的背景下,基于视觉本地化和LiDAR的方法之间有着强烈的竞争。 虽然LiDAR提供了重要的深度信息,但这种信息在分辨率上很少,而且很昂贵。另一方面,照相机成本低,而深层学习的最新发展意味着它们能够提供较高的本地化性能。然而,仍然存在一些根本性问题,特别是在不确定性领域,基于学习的方法可能臭名昭著地过于自信。Markov或基于网格的本地化是解决本地化问题的一个早期解决方案,但由于其计算复杂性,这种方法也变得偏好。将可能性字段作为网格(或体积)代表起来,这意味着精确度和记忆大小之间的交易。此外,有必要在整个可能性范围内进行昂贵的拼凑。尽管可以同时维持所有可能的地点的可能性,但基于网络的方法被更高效的粒子过滤器和蒙特卡洛本地化(MCL)所取代。然而,MCL(ML)提出了基于本地基础的问题,例如粒子淡化。 深层次的硬件的进步使得大量的可能性直接储存在GPU上,同时进行内部的硬化,从而有效地运用GPO-DReval化。我们现有的硬化方法。