Predicting the future states of surrounding traffic participants and planning a safe, smooth, and socially compliant trajectory accordingly is crucial for autonomous vehicles. There are two major issues with the current autonomous driving system: the prediction module is often decoupled from the planning module and the cost function for planning is hard to specify and tune. To tackle these issues, we propose an end-to-end differentiable framework that integrates prediction and planning modules and is able to learn the cost function from data. Specifically, we employ a differentiable nonlinear optimizer as the motion planner, which takes the predicted trajectories of surrounding agents given by the neural network as input and optimizes the trajectory for the autonomous vehicle, thus enabling all operations in the framework to be differentiable including the cost function weights. The proposed framework is trained on a large-scale real-world driving dataset to imitate human driving trajectories in the entire driving scene and validated in both open-loop and closed-loop manners. The open-loop testing results reveal that the proposed method outperforms the baseline methods across a variety of metrics and delivers planning-centric prediction results, allowing the planning module to output close-to-human trajectories. In closed-loop testing, the proposed method shows the ability to handle complex urban driving scenarios and robustness against the distributional shift that imitation learning methods suffer from. Importantly, we find that joint training of planning and prediction modules achieves better performance than planning with a separate trained prediction module in both open-loop and closed-loop tests. Moreover, the ablation study indicates that the learnable components in the framework are essential to ensure planning stability and performance.
翻译:预测交通参与者的未来状况,并规划一个安全、顺畅和符合社会要求的轨道,这对自主车辆至关重要。当前自主驾驶系统有两个主要问题:预测模块往往与规划模块脱钩,规划的成本功能很难具体和调和。为了解决这些问题,我们提议了一个端到端的不同框架,将预测和规划模块结合起来,并能够从数据中学习成本功能。具体地说,我们使用一个可区分的非线性优化工具,作为运动的开放模块,它将神经网络作为输入和优化自动驾驶工具的轨道提供的周围代理器预测轨迹,从而使自动驾驶工具的轨道轨迹与规划模块脱钩,从而使框架中的所有业务都能够区分开来,包括成本函数权重。为了解决这些问题,我们建议的框架将一个大型真实世界驱动数据集纳入预测和规划模块,并在整个驱动场景场上以开放和闭路模式的方式验证成本功能。 开放操作测试结果表明,拟议的方法比各种计量和经过培训的流程更精细的周期框架的基线方法要优于各种计量和经过精细的流程规划流程。 运行模型显示,我们使用更精细的流程的流程的流程的流程的流程路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路