In head-to-head racing, performing tightly constrained, but highly rewarding maneuvers, such as overtaking, require an accurate model of interactive behavior of the opposing target vehicle (TV). However, such information is not typically made available in competitive scenarios, we therefore propose to construct a prediction and uncertainty model given data of the TV from previous races. In particular, a one-step Gaussian Process (GP) model is trained on closed-loop interaction data to learn the behavior of a TV driven by an unknown policy. Predictions of the nominal trajectory and associated uncertainty are rolled out via a sampling-based approach and are used in a model predictive control (MPC) policy for the ego vehicle in order to intelligently trade-off between safety and performance when attempting overtaking maneuvers against a TV. We demonstrate the GP-based predictor in closed loop with the MPC policy in simulation races and compare its performance against several predictors from literature. In a Monte Carlo study, we observe that the GP-based predictor achieves similar win rates while maintaining safety in up to 3x more races. We finally demonstrate the prediction and control framework in real-time on hardware experiments.
翻译:在头对头的赛跑中,进行严格限制但大有回报的动作,如超载,需要有对立目标飞行器(TV)的精确互动行为模式。然而,这种信息通常不是在竞争情景中提供的,因此我们提议建立一个预测和不确定模式,根据以前种族的电视数据来设计电视。特别是,一个单步高山进程(GP)模型在模拟赛事中与MPC政策密闭循环中展示GP预测器,并将其性能与文献中的若干预测器进行比较。在蒙特卡洛的一项研究中,我们观察到GP预测器在维持最多三轮赛的安全的同时,取得了类似的赢率。我们最后展示了实时硬件实验的预测和控制框架。