Person re-identification (re-ID) plays an important role in applications such as public security and video surveillance. Recently, learning from synthetic data, which benefits from the popularity of synthetic data engine, has attracted great attention from the public eyes. However, existing datasets are limited in quantity, diversity and realisticity, and cannot be efficiently used for re-ID problem. To address this challenge, we manually construct a large-scale person dataset named FineGPR with fine-grained attribute annotations. Moreover, aiming to fully exploit the potential of FineGPR and promote the efficient training from millions of synthetic data, we propose an attribute analysis pipeline called AOST, which dynamically learns attribute distribution in real domain, then eliminates the gap between synthetic and real-world data and thus is freely deployed to new scenarios. Experiments conducted on benchmarks demonstrate that FineGPR with AOST outperforms (or is on par with) existing real and synthetic datasets, which suggests its feasibility for re-ID task and proves the proverbial less-is-more principle. Our synthetic FineGPR dataset is publicly available at https://github.com/JeremyXSC/FineGPR.
翻译:最近,从合成数据引擎受人欢迎的合成数据中学习,引起了公众的极大关注;然而,现有的数据集数量、多样性和现实性有限,无法有效地用于再识别问题;为了应对这一挑战,我们手工建立一个名为FineGPR的大型个人数据集,配有细微分数属性说明;此外,为了充分利用FineGPR的潜力,促进数百万合成数据的有效培训,我们提议了一个称为AOST的属性分析管道,该管道动态地学习真实域的属性分布,然后消除合成数据和现实世界数据之间的差距,从而可以自由地用于新的情景。根据基准进行的实验表明,FineGPR与AOST的超镜形(或接近于)现有真实和合成数据集,表明其重新识别任务的可行性,并证明流传的低比原则。我们的合成GPRPR数据集在https://github.com/JeremyX上公开提供。