Motion forecasting plays a significant role in various domains (e.g., autonomous driving, human-robot interaction), which aims to predict future motion sequences given a set of historical observations. However, the observed elements may be of different levels of importance. Some information may be irrelevant or even distracting to the forecasting in certain situations. To address this issue, we propose a generic motion forecasting framework (named RAIN) with dynamic key information selection and ranking based on a hybrid attention mechanism. The general framework is instantiated to handle multi-agent trajectory prediction and human motion forecasting tasks, respectively. In the former task, the model learns to recognize the relations between agents with a graph representation and to determine their relative significance. In the latter task, the model learns to capture the temporal proximity and dependency in long-term human motions. We also propose an effective double-stage training pipeline with an alternating training strategy to optimize the parameters in different modules of the framework. We validate the framework on both synthetic simulations and motion forecasting benchmarks in different domains, demonstrating that our method not only achieves state-of-the-art forecasting performance, but also provides interpretable and reasonable hybrid attention weights.
翻译:运动预测在不同领域(如自主驱动、人-机器人互动)起着重要作用,其目的是根据一系列历史观察预测未来运动序列,但观察到的因素可能具有不同的重要性,某些信息可能与预测在某些情况下的预测无关,甚至分散了注意力。为解决这一问题,我们提议了一个通用运动预测框架(名为RAIN),在混合关注机制的基础上进行动态关键信息选择和排序。一般框架分别用于处理多试剂轨道预测和人类运动预测任务。在前一项任务中,模型学会识别具有图表代表的物剂之间的关系并确定其相对重要性。在后一项任务中,模型学会掌握长期人类运动的时间接近性和依赖性。我们还提议一个有效的双阶段培训管道,同时采用交替培训战略,优化框架不同模块的参数。我们验证了合成模拟框架和不同领域移动预测基准的框架,表明我们的方法不仅实现了最新预测性业绩,而且还提供了可解释和合理的混合关注权重。