In multiagent environments, several decision-making individuals interact while adhering to the dynamics constraints imposed by the environment. These interactions, combined with the potential stochasticity of the agents' decision-making processes, make such systems complex and interesting to study from a dynamical perspective. Significant research has been conducted on learning models for forward-direction estimation of agent behaviors, for example, pedestrian predictions used for collision-avoidance in self-driving cars. However, in many settings, only sporadic observations of agents may be available in a given trajectory sequence. For instance, in football, subsets of players may come in and out of view of broadcast video footage, while unobserved players continue to interact off-screen. In this paper, we study the problem of multiagent time-series imputation, where available past and future observations of subsets of agents are used to estimate missing observations for other agents. Our approach, called the Graph Imputer, uses forward- and backward-information in combination with graph networks and variational autoencoders to enable learning of a distribution of imputed trajectories. We evaluate our approach on a dataset of football matches, using a projective camera module to train and evaluate our model for the off-screen player state estimation setting. We illustrate that our method outperforms several state-of-the-art approaches, including those hand-crafted for football.
翻译:在多试剂环境中,若干决策人员在遵守环境的动态限制的同时进行互动,这些互动,加上代理人决策程序的潜在随机性,使这种系统变得复杂和有趣,从动态角度进行研究。已经对前向估计代理人行为的学习模式进行了重大研究,例如,在自行驾驶的汽车中,为避免碰撞而使用的行人预测,但在很多情况下,在特定的轨迹序列中只能提供对代理人的零星观测。例如,在足球中,一些球员可能进出播放录像片段,而未观测到的球员则继续在屏幕外进行互动。在本论文中,我们研究了多试剂时间序列包设问题,因为过去和今后对各种制剂的观察都用于估计其他代理人缺失的观察。我们的方法称为“直观图”,使用前向和后向信息模型,以学习编织的轨迹的分布。我们评估了在足球机组外进行模拟的方法,我们用一个项目模型来评估了在足球机组外的模型,我们用几个摄影机组模型来评估。我们用几个摄影机模型来评估了我们的国家。