Modern robots require accurate forecasts to make optimal decisions in the real world. For example, self-driving cars need an accurate forecast of other agents' future actions to plan safe trajectories. Current methods rely heavily on historical time series to accurately predict the future. However, relying entirely on the observed history is problematic since it could be corrupted by noise, have outliers, or not completely represent all possible outcomes. To solve this problem, we propose a novel framework for generating robust forecasts for robotic control. In order to model real-world factors affecting future forecasts, we introduce the notion of an adversary, which perturbs observed historical time series to increase a robot's ultimate control cost. Specifically, we model this interaction as a zero-sum two-player game between a robot's forecaster and this hypothetical adversary. We show that our proposed game may be solved to a local Nash equilibrium using gradient-based optimization techniques. Furthermore, we show that a forecaster trained with our method performs 30.14% better on out-of-distribution real-world lane change data than baselines.
翻译:现代机器人需要准确的预测结果,以在现实世界中做出最佳决策。例如,自动驾驶汽车需要准确预测其他行动主体的未来行动,以规划安全的轨迹。目前的方法主要依赖历史时间序列对未来进行准确预测。然而,仅依赖观察到的历史数据是有问题的,因为它可能会受到噪声、异常值或不能完全表征所有可能结果的影响。为了解决这个问题,我们提出了一个用于生成机器人控制的稳健预测的新框架。为了模拟影响未来预测的现实世界因素,我们引入了对手的概念,这个对手通过扰动观察到的历史时间序列来增加机器人的最终控制成本。具体来说,我们将这种交互建模为机器人预测者与这个假想对手之间的零和二人博弈。我们表明,可以使用基于梯度的优化技术将我们提出的游戏解决为局部纳什平衡。此外,我们表明,通过我们的方法训练的预测器在处理真实世界的车道变换数据时比基线表现提高了30.14%。