Gait depicts individuals' unique and distinguishing walking patterns and has become one of the most promising biometric features for human identification. As a fine-grained recognition task, gait recognition is easily affected by many factors and usually requires a large amount of completely annotated data that is costly and insatiable. This paper proposes a large-scale self-supervised benchmark for gait recognition with contrastive learning, aiming to learn the general gait representation from massive unlabelled walking videos for practical applications via offering informative walking priors and diverse real-world variations. Specifically, we collect a large-scale unlabelled gait dataset GaitLU-1M consisting of 1.02M walking sequences and propose a conceptually simple yet empirically powerful baseline model GaitSSB. Experimentally, we evaluate the pre-trained model on four widely-used gait benchmarks, CASIA-B, OU-MVLP, GREW and Gait3D with or without transfer learning. The unsupervised results are comparable to or even better than the early model-based and GEI-based methods. After transfer learning, our method outperforms existing methods by a large margin in most cases. Theoretically, we discuss the critical issues for gait-specific contrastive framework and present some insights for further study. As far as we know, GaitLU-1M is the first large-scale unlabelled gait dataset, and GaitSSB is the first method that achieves remarkable unsupervised results on the aforementioned benchmarks. The source code of GaitSSB will be integrated into OpenGait which is available at https://github.com/ShiqiYu/OpenGait.
翻译:Gait 描述个人独特的和有区别的行走模式,并已成为人类识别的最有希望的生物鉴别特征之一。 作为一项细微的识别任务,动作识别很容易受到许多因素的影响,通常需要大量完全附加说明的数据,这些数据成本高且难以满足。 本文提出了一个大规模自我监督的基准,以通过对比学习来显示行走模式,目的是通过提供信息的行走前程和不同的现实世界差异,从大规模、无标签的行走视频中学习用于实际应用的一般行走模式。 具体地说,我们收集了一个大型的、没有标签的坐坐数据集( GaitLU-1M ),由1.02M 的行走序列组成,并提议一个概念上简单但有经验的强大基线模型。 实验性地说,我们根据四种广泛使用的行走标基准,即CASIA-B、OUMLP、GEW和Gait3D, 以及不进行转让学习,对总体行走的行走标和GEI-I的早期模式和GI-M-M-M-M-M-M-M-M-M-M-M-M-M-M-M-M-M-M-M-M-M-M-M-M-M-M-IL-IL-L-S-S-S-S-L-L-S-S-S-S-S-S-S-S-S-S-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-L-L-I-I-L-L-L-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I