The future motion of traffic participants is inherently uncertain. To plan safely, therefore, an autonomous agent must take into account multiple possible outcomes and prioritize them. Recently, this problem has been addressed with generative neural networks. However, most generative models either do not learn the true underlying trajectory distribution reliably, or do not allow likelihoods to be associated with predictions. In our work, we model motion prediction directly as a density estimation problem with a normalizing flow between a noise sample and the future motion distribution. Our model, named FloMo, allows likelihoods to be computed in a single network pass and can be trained directly with maximum likelihood estimation. Furthermore, we propose a method to stabilize training flows on trajectory datasets and a new data augmentation transformation that improves the performance and generalization of our model. Our method achieves state-of-the-art performance on three popular prediction datasets, with a significant gap to most competing models.
翻译:交通参与者的未来运动本质上是不确定的。 因此,为了安全地规划未来交通参与者的动作。 因此, 一个自主的代理机构必须考虑到多种可能的结果, 并优先处理这些结果。 最近, 这个问题已经通过基因神经网络得到解决。 但是, 大多数基因模型要么没有可靠地了解真正的基本轨迹分布, 或没有考虑到与预测有关的可能性。 在我们的工作中, 我们把运动预测直接作为密度估计问题进行模型, 使噪音样本与未来运动分布之间的流量正常化。 我们的模型名为Flomo, 允许在单一网络通行证中计算各种可能性, 并且可以直接进行最大可能的估计。 此外, 我们提出了一种方法来稳定轨迹数据集的培训流量和新的数据增强转换, 从而改进模型的性能和普及性能。 我们的方法在三种流行的预测数据集上取得了最先进的性能, 与大多数相互竞争的模型有很大差距。