Autonomous driving systems need to handle complex scenarios such as lane following, avoiding collisions, taking turns, and responding to traffic signals. In recent years, approaches based on end-to-end behavioral cloning have demonstrated remarkable performance in point-to-point navigational scenarios, using a realistic simulator and standard benchmarks. Offline imitation learning is readily available, as it does not require expensive hand annotation or interaction with the target environment, but it is difficult to obtain a reliable system. In addition, existing methods have not specifically addressed the learning of reaction for traffic lights, which are a rare occurrence in the training datasets. Inspired by the previous work on multi-task learning and attention modeling, we propose a novel multi-task attention-aware network in the conditional imitation learning (CIL) framework. This does not only improve the success rate of standard benchmarks, but also the ability to react to traffic lights, which we show with standard benchmarks.
翻译:自动驾驶系统需要处理复杂的情景,如车道跟踪、避免碰撞、轮流和对交通信号作出反应。近年来,基于端到端行为克隆的做法在点到点导航情景中表现显著,使用了现实的模拟器和标准基准。离线模拟学习很容易获得,因为它不需要昂贵的手记或与目标环境互动,但很难获得一个可靠的系统。此外,现有方法没有具体解决交通灯反应的学习问题,而这是培训数据集中罕见的。在以往多任务学习和注意力建模工作启发下,我们提议在有条件的模仿学习(CIL)框架内建立一个新的多任务关注网络。这不仅提高了标准基准的成功率,而且提高了对交通灯作出反应的能力,我们用标准基准表明了这一点。