Current multi-person localisation and tracking systems have an over reliance on the use of appearance models for target re-identification and almost no approaches employ a complete deep learning solution for both objectives. We present a novel, complete deep learning framework for multi-person localisation and tracking. In this context we first introduce a light weight sequential Generative Adversarial Network architecture for person localisation, which overcomes issues related to occlusions and noisy detections, typically found in a multi person environment. In the proposed tracking framework we build upon recent advances in pedestrian trajectory prediction approaches and propose a novel data association scheme based on predicted trajectories. This removes the need for computationally expensive person re-identification systems based on appearance features and generates human like trajectories with minimal fragmentation. The proposed method is evaluated on multiple public benchmarks including both static and dynamic cameras and is capable of generating outstanding performance, especially among other recently proposed deep neural network based approaches.
翻译:目前多人定位和跟踪系统过度依赖使用外观模型进行目标再识别,几乎没有办法为这两个目标采用完全的深层次学习解决办法。我们为多人定位和跟踪提出了一个新的、完整的深层次学习框架。在这方面,我们首先引入了光权分级结构,其次为人定位,它克服了典型在多人环境中发现的与隔离和噪音探测有关的问题。在拟议的跟踪框架内,我们利用行人轨迹预测方法的最新进展,并根据预测轨迹提出了新的数据关联计划。这消除了基于外观特征计算成本昂贵的人再识别系统的需求,并生成了像人类轨迹一样的极小的碎片。拟议方法根据多个公共基准进行评估,包括静态和动态相机,并且能够产生杰出的性能,特别是在最近提出的其他深神经网络方法中。