For a self-driving car to operate reliably, its perceptual system must generalize to the end-user's environment -- ideally without additional annotation efforts. One potential solution is to leverage unlabeled data (e.g., unlabeled LiDAR point clouds) collected from the end-users' environments (i.e. target domain) to adapt the system to the difference between training and testing environments. While extensive research has been done on such an unsupervised domain adaptation problem, one fundamental problem lingers: there is no reliable signal in the target domain to supervise the adaptation process. To overcome this issue we observe that it is easy to collect unsupervised data from multiple traversals of repeated routes. While different from conventional unsupervised domain adaptation, this assumption is extremely realistic since many drivers share the same roads. We show that this simple additional assumption is sufficient to obtain a potent signal that allows us to perform iterative self-training of 3D object detectors on the target domain. Concretely, we generate pseudo-labels with the out-of-domain detector but reduce false positives by removing detections of supposedly mobile objects that are persistent across traversals. Further, we reduce false negatives by encouraging predictions in regions that are not persistent. We experiment with our approach on two large-scale driving datasets and show remarkable improvement in 3D object detection of cars, pedestrians, and cyclists, bringing us a step closer to generalizable autonomous driving.
翻译:为了使自主驾驶汽车能够可靠运行,其感知系统必须能够推广到最终用户的环境中,最好是不需要额外的注释工作。一个潜在的解决方案是利用从最终用户环境中收集的未标记数据(例如未标记的LiDAR点云)来适应系统到训练和测试环境之间的差异,而无需额外的标注。尽管之前对这种无监督领域适应问题进行了广泛的研究,但仍有一个根本问题存在:在目标领域中没有可靠的信号来监督适应过程。为了克服这个问题,我们观察到从多次遍历的重复路线收集无监督数据非常容易。虽然这与传统的无监督领域适应有所不同,但这个假设非常现实,因为许多司机共享同样的道路。我们展示了这个简单的额外假设足以产生有效信号,使我们能够在目标领域上执行迭代自我训练的3D物体检测器。具体来说,我们使用域外检测器生成伪标签,但通过删除在多次遍历中持续存在的被认为是移动对象的检测来减少误报。此外,我们通过鼓励在不持久的区域中进行预测来减少误报漏报。我们在两个大型驾驶数据集上进行实验,展示了汽车、行人和骑自行车者的3D物体检测方面的显着提高,为我们实现自适应无监督驾驶带来了一步之遥。