In head-to-head racing, an accurate model of interactive behavior of the opposing target vehicle (TV) is required to perform tightly constrained, but highly rewarding maneuvers such as overtaking. However, such information is not typically made available in competitive scenarios, we therefore propose to construct a prediction and uncertainty model given data of the TV from previous races. In particular, a one-step Gaussian process (GP) model is trained on closed-loop interaction data to learn the behavior of a TV driven by an unknown policy. Predictions of the nominal trajectory and associated uncertainty are rolled out via a sampling-based approach and are used in a model predictive control (MPC) policy for the ego vehicle in order to intelligently trade-off between safety and performance when attempting overtaking maneuvers against a TV. We demonstrate the GP-based predictor in closed loop with the MPC policy in simulation races and compare its performance against several predictors from literature. In a Monte Carlo study, we observe that the GP-based predictor achieves similar win rates while maintaining safety in up to 3x more races. We finally demonstrate the prediction and control framework in real-time in a experimental study on a 1/10th scale racecar platform operating at speeds of around 2.8 m/s, and show a significant level of improvement when using the GP-based predictor over a baseline MPC predictor. Videos of the hardware experiments can be found at https://youtu.be/KMSs4ofDfIs.
翻译:在头对头赛中,对立目标飞行器(TV)的准确互动行为模式需要严格约束,但高回报性强的动作,如超额操作等。然而,这种信息通常不是在竞争情景中提供的,因此我们提议建立一个预测和不确定性模型,给电视台提供以往赛事的数据。特别是,对闭路互动数据进行了一次性Gaussian流程(GP)模式的培训,以了解由未知政策驱动的电视行为。对名义轨迹和相关不确定性的预测是通过抽样方法推出的,并用于自我驱动飞行器的模型预测控制(MPC)政策,以便在试图对电视过度操作时,在安全和性能之间进行明智的交换。我们在模拟赛事时,与MPC政策进行闭路的预测,将其性能与数个预测器进行比较。在蒙特卡洛的一项研究中,我们观察到GPY预测和相关的不确定性预测值和相关不确定性,同时保持最高至3x的种族的安全性能。我们最后展示了GV4的预测和性能之间的平衡,在实际10年的实验性水平上,我们展示了一次实验级的预测和控制水平上,同时展示了一次实验级的轨道/10级的进度。</s>