The waterdrops on windshields during driving can cause severe visual obstructions, which may lead to car accidents. Meanwhile, the waterdrops can also degrade the performance of a computer vision system in autonomous driving. To address these issues, we propose an attention-based framework that fuses the spatio-temporal representations from multiple frames to restore visual information occluded by waterdrops. Due to the lack of training data for video waterdrop removal, we propose a large-scale synthetic dataset with simulated waterdrops in complex driving scenes on rainy days. To improve the generality of our proposed method, we adopt a cross-modality training strategy that combines synthetic videos and real-world images. Extensive experiments show that our proposed method can generalize well and achieve the best waterdrop removal performance in complex real-world driving scenes.
翻译:驾驶期间挡风玻璃上的水滴可造成严重的视觉障碍,可能导致汽车事故。与此同时,水滴也可降低自动驾驶时计算机视觉系统的性能。为了解决这些问题,我们提议了一个关注框架,将孔-时表从多个框架结合起来,以恢复被水滴所覆盖的视觉信息。由于缺乏关于删除视频水滴的培训数据,我们提议建立一个大型合成数据集,在雨季复杂的驾驶场上模拟水滴。为了改进我们拟议方法的普遍性,我们采取了一种将合成视频与现实世界图像结合起来的跨模式培训战略。广泛的实验表明,我们拟议的方法可以很好地推广,在复杂的现实世界驾驶场上实现最佳的清除水滴功能。