The process of designing costmaps for off-road driving tasks is often a challenging and engineering-intensive task. Recent work in costmap design for off-road driving focuses on training deep neural networks to predict costmaps from sensory observations using corpora of expert driving data. However, such approaches are generally subject to over-confident mispredictions and are rarely evaluated in-the-loop on physical hardware. We present an inverse reinforcement learning-based method of efficiently training deep cost functions that are uncertainty-aware. We do so by leveraging recent advances in highly parallel model-predictive control and robotic risk estimation. In addition to demonstrating improvement at reproducing expert trajectories, we also evaluate the efficacy of these methods in challenging off-road navigation scenarios. We observe that our method significantly outperforms a geometric baseline, resulting in 44% improvement in expert path reconstruction and 57% fewer interventions in practice. We also observe that varying the risk tolerance of the vehicle results in qualitatively different navigation behaviors, especially with respect to higher-risk scenarios such as slopes and tall grass.
翻译:设计越野驾驶任务的成本图往往是一项具有挑战性和工程密集性的任务。最近关于越野驾驶的成本图设计工作的重点是培训深神经网络,以利用专家驾驶数据的公司体对感官观测进行预测的成本图。然而,这些方法一般会受到过于自信的误解,很少在物理硬件的现场评估。我们提出了一种反向强化学习法,有效培训具有不确定性的深成本功能。我们利用在高度平行模型预测控制和机器人风险估计方面的最新进展这样做。除了在再生专家轨迹方面显示出改进外,我们还评估了这些方法在挑战越野航行情景方面的效力。我们发现,我们的方法大大超越了几何基线,导致专家路径重建44%的改进,实际干预减少57%。我们还观察到,在质量上不同的导航行为中,特别是斜坡和高草等高风险情形中,车辆的风险容忍度各不相同。