Occluded person re-identification (Re-ID) is a challenging problem due to the destruction of occluders. Most existing methods focus on visible human body parts through some prior information. However, when complementary occlusions occur, features in occluded regions can interfere with matching, which affects performance severely. In this paper, different from most previous works that discard the occluded region, we propose a Feature Completion Transformer (FCFormer) to implicitly complement the semantic information of occluded parts in the feature space. Specifically, Occlusion Instance Augmentation (OIA) is proposed to simulates real and diverse occlusion situations on the holistic image. These augmented images not only enrich the amount of occlusion samples in the training set, but also form pairs with the holistic images. Subsequently, a dual-stream architecture with a shared encoder is proposed to learn paired discriminative features from pairs of inputs. Without additional semantic information, an occluded-holistic feature sample-label pair can be automatically created. Then, Feature Completion Decoder (FCD) is designed to complement the features of occluded regions by using learnable tokens to aggregate possible information from self-generated occluded features. Finally, we propose the Cross Hard Triplet (CHT) loss to further bridge the gap between complementing features and extracting features under the same ID. In addition, Feature Completion Consistency (FC$^2$) loss is introduced to help the generated completion feature distribution to be closer to the real holistic feature distribution. Extensive experiments over five challenging datasets demonstrate that the proposed FCFormer achieves superior performance and outperforms the state-of-the-art methods by significant margins on occluded datasets.
翻译:隐蔽的人重新身份( Re- ID) 是一个挑战性的问题, 原因是 occluder 的破坏。 多数现有方法都通过某些先前的信息关注可见的人体器官部分。 但是, 当出现互补的封闭性能时, 隐蔽区域的特征会干扰匹配, 这会严重影响性能。 在本文中, 不同于先前丢弃隐蔽区域的大部分工程, 我们建议使用一个“ 功能完成变异器( FFCFormer) ” 来隐含地补充功能空间内隐蔽部分的语义信息。 具体地说, 隐蔽性增强性能( OIIA) 是要模拟整体图像上真实和多样化的隐蔽性能。 这些增强的图像不仅能丰富了培训集聚变异性样本的数量, 而且还会与整体图像形成配对。 随后, 我们建议使用一个带有共同编码的双流结构来学习投入的配对式的配对性特征。 如果没有额外的语义信息, 可以自动生成一个隐隐蔽性特征添加的样本配对 。 然后, 将精度变缩缩缩缩缩缩化的功能在 Clodeal- dreal decreal dreal develdeal develdealdal drefolde 。</s>