Navigating off-road with a fast autonomous vehicle depends on a robust perception system that differentiates traversable from non-traversable terrain. Typically, this depends on a semantic understanding which is based on supervised learning from images annotated by a human expert. This requires a significant investment in human time, assumes correct expert classification, and small details can lead to misclassification. To address these challenges, we propose a method for predicting high- and low-risk terrains from only past vehicle experience in a self-supervised fashion. First, we develop a tool that projects the vehicle trajectory into the front camera image. Second, occlusions in the 3D representation of the terrain are filtered out. Third, an autoencoder trained on masked vehicle trajectory regions identifies low- and high-risk terrains based on the reconstruction error. We evaluated our approach with two models and different bottleneck sizes with two different training and testing sites with a fourwheeled off-road vehicle. Comparison with two independent test sets of semantic labels from similar terrain as training sites demonstrates the ability to separate the ground as low-risk and the vegetation as high-risk with 81.1% and 85.1% accuracy.
翻译:快速自主车辆的越野导航取决于一个强大的认知系统,它能将车辆的轨迹与非可移动地形区分开来。 通常,这取决于对语义的理解,这种理解的基础是从人类专家附加说明的图像中监督地学习。 这需要在人的时间方面投入大量资金,假设正确的专家分类,而细小的细节可能导致分类错误。 为了应对这些挑战,我们提出了一个方法来预测仅从车辆过去的经验中以自我监督的方式预测高低风险地形。首先,我们开发了一种工具,将车辆的轨迹投射到前摄像头图像中。第二,3D地形图示中的隐蔽被过滤。第三,在蒙蔽车辆轨迹区域受过训练的自动编码器根据重建错误确定了低风险和高风险地形。我们用两种模型和不同的瓶盖大小评估了我们的方法,用四轮式越野车辆进行了两个不同的培训和测试地点。比较了与培训地点相似的两套独立的语义标志,显示将地面分为低风险和85 %和高风险植被的能力,81.1和81.1%。