In this paper, we present the Multi-view Extended Videos with Identities (MEVID) dataset for large-scale, video person re-identification (ReID) in the wild. To our knowledge, MEVID represents the most-varied video person ReID dataset, spanning an extensive indoor and outdoor environment across nine unique dates in a 73-day window, various camera viewpoints, and entity clothing changes. Specifically, we label the identities of 158 unique people wearing 598 outfits taken from 8, 092 tracklets, average length of about 590 frames, seen in 33 camera views from the very large-scale MEVA person activities dataset. While other datasets have more unique identities, MEVID emphasizes a richer set of information about each individual, such as: 4 outfits/identity vs. 2 outfits/identity in CCVID, 33 viewpoints across 17 locations vs. 6 in 5 simulated locations for MTA, and 10 million frames vs. 3 million for LS-VID. Being based on the MEVA video dataset, we also inherit data that is intentionally demographically balanced to the continental United States. To accelerate the annotation process, we developed a semi-automatic annotation framework and GUI that combines state-of-the-art real-time models for object detection, pose estimation, person ReID, and multi-object tracking. We evaluate several state-of-the-art methods on MEVID challenge problems and comprehensively quantify their robustness in terms of changes of outfit, scale, and background location. Our quantitative analysis on the realistic, unique aspects of MEVID shows that there are significant remaining challenges in video person ReID and indicates important directions for future research.
翻译:在本文中,我们为野外大型视频人重新识别(ReID)提供了多视扩展视频(MEVID)数据集。据我们所知,MEVID代表了最多样化视频人 ReID数据集,覆盖了在73天窗口中的9个独特日期的广泛的室内和室外环境,覆盖了73天窗口、各种相机视角和实体服装变化。具体地说,我们标记了158个独特的人的身份,他们身着598件服装,取自8,092个轨道,平均长度约590个框架,取自大型MEVA人活动数据集的33个相机视图。尽管其他数据集具有更独特的身份,但MEVID强调的是一套关于每个人的更丰富信息,例如:4件制服/身份相对于CCVID的2件制服/身份,17个地点的33个视角,比5个模拟地点的6个视频位置,1 000万框架相对于LS-VID的300万框架。基于MEVA视频数据集,我们还继承了未来目标数据,而我们刻意要从人口统计学角度对数字定位进行快速跟踪分析,让我们的多层次观察。