Understanding the multiple socially-acceptable future behaviors is an essential task for many vision applications. In this paper, we propose a tree-based method, termed as Social Interpretable Tree (SIT), to address this multi-modal prediction task, where a hand-crafted tree is built depending on the prior information of observed trajectory to model multiple future trajectories. Specifically, a path in the tree from the root to leaf represents an individual possible future trajectory. SIT employs a coarse-to-fine optimization strategy, in which the tree is first built by high-order velocity to balance the complexity and coverage of the tree and then optimized greedily to encourage multimodality. Finally, a teacher-forcing refining operation is used to predict the final fine trajectory. Compared with prior methods which leverage implicit latent variables to represent possible future trajectories, the path in the tree can explicitly explain the rough moving behaviors (e.g., go straight and then turn right), and thus provides better interpretability. Despite the hand-crafted tree, the experimental results on ETH-UCY and Stanford Drone datasets demonstrate that our method is capable of matching or exceeding the performance of state-of-the-art methods. Interestingly, the experiments show that the raw built tree without training outperforms many prior deep neural network based approaches. Meanwhile, our method presents sufficient flexibility in long-term prediction and different best-of-$K$ predictions.
翻译:了解社会上可接受的多种未来行为是许多愿景应用的基本任务。 在本文中, 我们提出一种基于树的方法, 称为“ 社会解释树 ” ( SIT ), 以解决这一多模式预测任务, 手工制作的树取决于观测轨迹的先前信息, 以模拟未来的多轨迹。 具体地说, 从树根到叶的一条路径代表了个人未来可能的轨迹。 SIT 使用粗糙到平淡优化战略, 树首先以高端速度建造, 以平衡树的复杂程度和覆盖, 然后以优化贪婪方式鼓励多式联运。 最后, 使用教师推力改进操作来预测最终的细轨迹。 与先前使用隐含的潜在变量来代表未来可能的轨迹。 树的一条路径可以明确解释粗糙的移动行为( 例如, 直走然后右转, ), 从而提供更好的解释性。 尽管手工制作的树, 而在ETH- CY 和 Stefard Dron- train- train- train- train- transviewings reviews reviews 中, 都展示了我们之前的模型方法能够显示许多种方法, 。