Although there is a significant development in 3D Multi-view Multi-person Tracking (3D MM-Tracking), current 3D MM-Tracking frameworks are designed separately for footprint and pose tracking. Specifically, frameworks designed for footprint tracking cannot be utilized in 3D pose tracking, because they directly obtain 3D positions on the ground plane with a homography projection, which is inapplicable to 3D poses above the ground. In contrast, frameworks designed for pose tracking generally isolate multi-view and multi-frame associations and may not be robust to footprint tracking, since footprint tracking utilizes fewer key points than pose tracking, which weakens multi-view association cues in a single frame. This study presents a Unified Multi-view Multi-person Tracking framework to bridge the gap between footprint tracking and pose tracking. Without additional modifications, the framework can adopt monocular 2D bounding boxes and 2D poses as the input to produce robust 3D trajectories for multiple persons. Importantly, multi-frame and multi-view information are jointly employed to improve the performance of association and triangulation. The effectiveness of our framework is verified by accomplishing state-of-the-art performance on the Campus and Shelf datasets for 3D pose tracking, and by comparable results on the WILDTRACK and MMPTRACK datasets for 3D footprint tracking.
翻译:虽然3D多视角多人跟踪(3D MM-Tracking)方面有重大发展,但目前的3D MM-跟踪框架是单独设计用于足迹和跟踪的,具体而言,3D构成跟踪无法使用足迹跟踪框架,因为它们直接在地面上获得3D位置,而地面的同影投影则不适用3D;相比之下,为显示跟踪而设计的框架一般孤立多视角和多框架协会,可能不适于足迹跟踪,因为足迹跟踪使用的关键点少于构成跟踪的关键点,这削弱了多视角关联在单一框架中的信号;本研究报告提出了一个统一多视角多视角多人跟踪框架,以弥合足迹跟踪和跟踪之间的差距;如果不作进一步修改,该框架可采用单镜2D捆绑框,2D作为为多人制作强健的3D轨迹的投入;使用重要、多框架和多视角信息共同用于改善关联和三角的绩效。