Motion planning for safe autonomous driving requires learning how the environment around an ego-vehicle evolves with time. Ego-centric perception of driveable regions in a scene not only changes with the motion of actors in the environment, but also with the movement of the ego-vehicle itself. Self-supervised representations proposed for large-scale planning, such as ego-centric freespace, confound these two motions, making the representation difficult to use for downstream motion planners. In this paper, we use geometric occupancy as a natural alternative to view-dependent representations such as freespace. Occupancy maps naturally disentangle the motion of the environment from the motion of the ego-vehicle. However, one cannot directly observe the full 3D occupancy of a scene (due to occlusion), making it difficult to use as a signal for learning. Our key insight is to use differentiable raycasting to "render" future occupancy predictions into future LiDAR sweep predictions, which can be compared with ground-truth sweeps for self-supervised learning. The use of differentiable raycasting allows occupancy to emerge as an internal representation within the forecasting network. In the absence of groundtruth occupancy, we quantitatively evaluate the forecasting of raycasted LiDAR sweeps and show improvements of upto 15 F1 points. For downstream motion planners, where emergent occupancy can be directly used to guide non-driveable regions, this representation relatively reduces the number of collisions with objects by up to 17% as compared to freespace-centric motion planners.
翻译:安全自主驾驶的动态规划需要学习自我车辆周围的环境如何随着时间而演变。 自我车辆周围的环境如何演变。 对可驱动区域在一场景中的偏重感不仅随着环境行为者的动作而变化,而且随着自我车辆本身的移动而变化。 为大规模规划(如以自我为中心的自由空间)提议自我监督的演示,混淆了这两个动作,使得下游运动规划者难以使用这些动作的表达方式。 在本文中,我们使用几何占用作为自然的替代方法,取代自由空间等依赖视觉的表达方式。 自我飞行器的运动地图自然将环境运动与环境运动与自我车辆运动分解。然而,人们无法直接观察3D对一个场景的完全占用情况(由于闭塞,因此难以用作学习信号)。 我们的主要洞察力是使用不同的光线将未来占用预测结果“ 重新显示” 进入下游运动规划者。 与可地面巡勘测的自我监督学习相比, 使用不同式的对环境物体的移动物体运动的移动自然分解。 然而, 使用不易对17天线使空间的占用可以直接观察到, 进行内部预测, 进行内部预测, 进行我们用来对地平流, 进行内部预测。